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GENE EXPRESSION IN BLADDER TUMORS 



This application claims the benefit of U.S. Provisional Application 
No. 60/121,124, filed February 22, 1999, which is hereby incorporated by 
reference in its entirety. 

5 TECHNICAL FIELD OF THE INVENTION 

This invention is related to the field of cancer diagnosis and treatment. 
In particular it is related to the use of gene expression to categorize and detect 
tumors. 

BACKGROUND OF THE INVENTION 

10 The building of large databases containing human genome sequences 

is the basis for studies of gene expressions in various tissues during normal 
physiological and pathologic conditions. Constantly (constitutively) expressed 
sequences as well as sequences whose expression is altered during disease 
processes are important for our understanding of cellular properties, and for 

15 the identification of candidate genes for future therapeutic intervention. As the 
number of known genes and ESTs build up in the databases, array-based 
simultaneous screening of thousands of genes is necessary to obtain a profile 
of transcriptional behaviour, and to identify key genes that, either alone or in 
combination with other genes, control various aspects of cellular life. One 
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Still another object of the invention is to provide a method of 
identifying a tissue sample as urothelial. 

Yet another object of the invention provides a method of determining 
an expression pattern of a bladder tissue sample independent of the proportion 
5 of submucosal, muscle, and connective tissue cells present. 

These and other objects of the invention are achieved by providing 
one or more of the embodiments described below. In one embodiment a 
method is provided of determining an expression pattern of a cell sample 
independent of the proportion of submucosal, smooth muscle, or connective 

10 tissue cells present. Expression is determined of one or more genes in a sample 
comprising cells. The one or more genes exclude genes which are expressed 
in the submucosal, muscle, and connective tissue. A pattern of expression is 
formed for the sample which is independent of the proportion of submucosal, 
muscle, and connective tissue cells in the sample. 

15 In another aspect of the invention a method of determining an 

expression pattern of a cell sample is provided. Expression is determined of 
one or more genes in a sample comprising cells. A first pattern of expression 
is thereby formed for the sample. Genes which are expressed in submucosal, 
smooth muscle, or connective tissue cells are removed from the first pattern of 

20 expression, forming a second pattern of expression which is independent of the 
proportion of submucosal, smooth muscle, or connective tissue cells in the 
sample. 

Another embodiment of the invention provides a method for 
determining an expression pattern of a urotheiium or bladder cancer cell. 

25 Expression is determined of one or more genes in a sample comprising 
urotheiium or bladder cancer cells; the expression determined forms a first 
pattern of expression. A second pattern of expression which was formed using 
the one or more genes and a sample comprising predominantly submucosal, 
smooth muscle, or connective tissue cells, is subtracted from the first pattern 

30 of expression, forming a third pattern of expression. The third pattern of 
expression reflects expression of the urotheiium or bladder cancer cells 
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independent of the proportion of submucosal, smooth muscle, or connective 
tissue cells present in the sample. 

In another embodiment of the invention a method is provided of 
detecting an invasive tumor in a patient. A marker is detected in a sample of 
5 a body fluid. The body fluid is selected from the group consisting of blood, 
plasma, serum, urine, ascites fluid, pleural fluid, spinal fluid, sputum, and 
mucous secretions. The marker is an mRNA or protein expression product of 
a gene which is more prevalent in submucosal, smooth muscle, or connective 
tissue than in the body fluid. An increased amount of the marker in the body 

10 fluid indicates a tumor which has become invasive in the patient. 

In another aspect of the invention a method is provided for diagnosing 
a bladder cancer. A first pattern of expression is determined of one or more 
genes in a bladder tissue sample suspected of being neoplastic. The first 
pattern of expression is compared to a second and third reference pattern of 

15 expression. The second pattern is of the one or more genes in normal 

urothelium and the third pattern is of the one or more genes in bladder cancer. 
A first pattern of expression which is found to be more similar to the third 
pattern than the second indicates neoplasia of the bladder tissue sample. 

According to yet another aspect of the invention a method is provided 

20 for predicting outcome or prescribing treatment of a bladder tumor. A first 
pattern of expression is determined of one or more genes in a bladder tumor 
sample. The first pattern is compared to one or more reference patterns of 
expression determined for bladder tumors at a grade between I and IV. The 
reference pattern which shares maximum similarity with the first pattern is 

25 identified. The outcome or treatment appropriate for the grade of tumor of the 
reference pattern with the maximum similarity is assigned to the bladder tumor 
sample. 

In another embodiment of the invention a method is provided for 
determining grade of a bladder tumor. A first pattern of expression is 
30 determined of one or more genes in a bladder tumor sample. The first pattern 
is compared to one or more reference patterns of expression determined for 
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bladder tumors at a grade between I and IV. The reference pattern which 
shares maximum similarity with the first pattern is identified. The grade of the 
reference pattern with the maximum similarity is assigned to the bladder tumor 
sample. 

5 Yet another embodiment of the invention provides a method to 

determine stage of a bladder tumor. A first pattern of expression is 
determined of one or more genes in a bladder tumor sample. The first pattern 
is compared to one or more reference patterns of expression determined for 
bladder tumors at different stages. The reference pattern which shares 
10 maximum similarity with the first pattern is identified. The stage of the 
reference pattern with the maximum similarity is assigned to the bladder tumor 
sample. 

In still another embodiment of the invention a method is provided for 
identifying a tissue sample as urothelial. A first pattern of expression is 

15 determined of one or more genes in a tissue sample. The first pattern of 
expression is compared to a second pattern of expression obtained from normal 
urothelial cells. Similarity between the first and second patterns identifies the 
tissue sample is urothelial in its origin. 

Another aspect of the invention is a method to identify a set of genes 

20 useful for diagnosing, predicting outcome, or prescribing treatment of a bladder 
cancer. A first pattern of expression is determined of one or more genes in a 
first bladder tissue sample. A second pattern of expression is determined of the 
one or more genes in a second bladder tissue sample. The first bladder tissue 
sample is a normal urothelium sample or an earlier stage or lower grade of 

25. bladder tumor than the second bladder tissue sample. The first pattern of 
expression is compared to the second pattern of expression to identify a first 
set of genes whose expression is increased or decreased in the second bladder 
tissue sample relative to the first bladder tissue sample. Those genes which are 
expressed in submucosal, smooth muscle or connective tissue are removed 

30 from the first set of genes to produce a second set of genes. Expression of the 
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second set of genes can be used for diagnosing, predicting outcome, or 
prescribing treatment of a bladder cancer. 

According to yet another aspect of the invention a method is provided 
for determining an expression pattern of a bladder tissue sample independent 
5 of the proportion of submucosal, smooth muscle, or connective tissue cells 
present. A single-cell suspension of disaggregated bladder tumor cells is 
isolated from a bladder tissue sample comprising bladder cells, submucosal 
cells, smooth muscle cells, or connective tissue cells. The expression of one 
or more genes in the single-cell suspension is determined. A pattern of 

10 expression is thus formed for the sample which is independent of the 
proportion of submucosal, smooth muscle, or connective tissue cells in the 
bladder tissue sample. 

According to still another aspect of the invention a method is provided 
for screening compounds to identify candidate therapeutic agents for treating 

15 bladder cancer. Bladder tumor cells are contacted with a test compound. 

Gene expression of one or more genes is determined in the bladder tumor cells 
which have been contacted with the test compound. The one or more genes 
are ones whose expression changes during the development of a bladder 
cancer. A test compound is identified as a candidate therapeutic agent if it 

20 causes gene expression of at least one of the one or more genes to change to 
a level which is characteristic of an earlier stage of cancer progression. 

The present invention thus provides the art with numerous methods for 
molecularly assessing bladder cells. The methods aid the art in diagnosing, 
identifying, classifying, treating, detecting, and treating tumors of the bladder. 

25 BRIEF DESCRIPTION QF THE DRAWINGS AND TABLES 

Figure 1 shows a distribution of expression levels in bladder wall tissue 
expressed in arbitrary units. Only genes scored as present or marginally 
present are shown. 

Figure 2^hows a comparison of intensity differences between a tumor 
30 and a pool of tumors of the same stage and grade, and two pools of different 
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stages and grades. The difference is larger between the pools, demonstrating 
the validity of using expression patterns to determine stage, grade, prognosis, 
and treatment regimen. 

All genes scored as present on two chips (approximately 9000 genes) 
5 were compared. Increased genes were compared to increased genes and 
decreased genes to decreased genes, on two separate chips, followed by 
plotting of the numerical difference of the medians. A TaGrll tumor compared 
to the TaGrll pool, and the TaGrll pool to T2GrIV pool (1, 2, 3, 4); T2Grffl 
to T2Grm pool, and T2GrIII pool to T2GrIV pool (5, 6, 7, 8); T2GrIV to 

10 T2GrIV pool, and T2GrIV pool to TaGrll pool (9, 10, 1 1, 12). Paired T-test 
of medians showed a borderline significant difference, with pool vs. pool 
scoring highest (P- value of 0.07). 

Figure 3 shows progression of a bladder cancer from normal urothelium 
to invasive grade IV tumor. The expression patterns change during the 

15 progression, with a great variation in pattern from stage to stage, but also 

within a stage and even within tumors having the same stage and grade of 
atypia. 

Figure 4 shows the correlation between transcript levels from genes 
expressed in at least one sample. Fig. 4A demonstrates the repeatability of 

20 microarray expression analysis. Duplicate determinations on a normal sample 
are compared. Fig. 4B is a plot of a pTa tumor vs. the normal pool. Fig. 4C 
is a plot of an invasive pT2 tumor versus the normal pool. The vast majority 
of transcripts are present at similar levels in both normal and tumor tissue. 
Figure 5 shows dendrograms of tissues based on different clustering 

25 methods. Clustering was either based on log-fold change in expression level 
of genes (Figs. 5A, 5C), or the absolute difference (Figs. 5B, 5D), comparing 
tumor to a pool of normal samples. Genes used for clustering were either 
those 10% of the genes that covaried best with progression (A, B), or all 4076 
genes that were scored as present in at least one sample (C, D). 

30 Figure 6A through 6F show how the pattern of expression changes 

during progression of bladder cancer based on levels of transcripts in pools of 
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normal biopsies, superficial pTa tumors grade II, and invasive pT2+grade IV 
tumors. The curve at the top left portion of each subfigure shows the direction 
of change in gene expression based on pools of normal urothelium (open 
circle), superficial pTa tumor (gray circle) and invasive pT2+ tumor (black 
5 circle). Fold change in gene expression level was calculated on a probe-to- 
probe basis using 20 probes per gene and eliminating the highest and lowest 
outliers (olympic scoring). It is noteworthy that reduced expression is the most 
common event. 

Figure 7 shows a cluster diagram of 9 bladder tumors representing the 
10 progression of bladder cancer. Each column represents a tumor preparation, 
and each row a gene. The diagrams show clustering based on log-fold change 
from normal urothelium (left diagram) and based on absolute difference from 
normal urothelium (right diagram). A decreased expression is displayed as 
shades of cyan, an increased expression as shades of yellow, and no change as 
15 black. 

The dendrograms at each side show the relation between the different 
genes. In the middle, distinct functional clusters are identified and members of 
the clusters are annotated in brief (for full length ED of all genes in the diagram 
and Genbank numbers see www.mdl.dk/supplementary data). In an effort to 

20 identify those genes most indicative of cancer progression a weighting scheme 
was used to select the 400 genes that covaried best with the different stages of 
bladder cancer. Gene clustering was based on normalized Euclidean distance 
(vector angle) calculated between genes or gene cluster centers. 

Figure 8 shows the vector angle between pools and individual single 

25 cell preparations or biopsies. The numbers refer to patient samples and the 
stage of each is indicated. Pools are identified as follows: squares, Ta grade 
I pool; diamonds, Ta grade II pool; circles, T2-4 grade ID pool; triangles, T2-4 
grade IV pool. The lowest angle for each sample determines the whether the 
sample is classified as a superficial (Ta or Tl) or muscle- invasive tumor (Tl- 

30 T4). 
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Figure 9 shows a comparison of Northern blots and oligonucleotide 
arrays. The samples analyzed were normal pool (Norm), superficial pTaGrl 
tumor (335), minimally invasive pTl grade III (901), and invasive pT2 grade 
III (713). The Northern blots were scanned by densitometry and plotted (solid 
5 lines) together with a plot of the level detected on the arrays (dotted lines). 
The levels of expression ranged from 3-6,000 units (beta -2 microglobulin) to 
100-600 units (E-cadherin). The level of transcripts detected was similar with 
both methods. 

Figure 10 shows immunohistochemical staining of the tissue sections 
10 used for expression analysis. On each section the protein examined is indicated 
and the level measured on the oligonucleotide array. Arrows indicate stained 
urothelial cells in cathepsin E (297 arbitrary units), ApoE (389 units), and 
CD59 (260 units) stainings, and stained stromal cells or leukocytes in beta2- 
microglobulin (2481 units) and cystatin C (941 units). 
15 Figure 11 presents a model of gene expression events during the 

progression of bladder cancer. The top of the figure shows the stages of 
bladder cancer, and the lower part shows the sequence of transcriptional 
events. The color cyan identifies reduced expression, yellow increased 
expression (also indicated by arrows). The figure is based on data from cluster 
20 analysis, and combines the different cluster methods. 

Table 1 shows genes which were highly expressed in bladder wall. 
Expression is shown in "connective tissue" which includes muscle and 
submucosal cells, a TaGr m, a T 2 Gr m, and a T 2 Gr IV bladder tumor. Genes 
above the 90* percentile are grouped according to the purported function of 
25 the protein. 

Table 2A shows high intensity genes in bladder wall compared to single 
cell solutions and biopsies of tumors. 

Table 2B shows expression of genes related to bladder wall. 
Table 3 shows the number of genes that are expressed as in the tumor- 
30 pool to which the tumor belongs, or altered as in a tumor pool of higher or 
lower stage or grade. 
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Several of the tables described above contain lists which include both 
genes and expressed sequence tags (ESTs). Reference to the "genes" of a table 
shall be understood to include the gene containing the EST. 

DETAILED DESC RIPTION OF THE INVENTION 

5 It is a discovery of the present inventor that characteristic patterns of 

expression of genes can be used to characterize different types of tissue. Thus, 
for example, gene expression patterns can be used to characterize stages and 
grades of bladder tumors. Similarly, gene expression patterns can be used to 
distinguish cells having a bladder origin from other cells. Moreover, gene 

10 expression of cells which routinely contaminate bladder tumor biopsies has 
been identified, and such gene expression can be removed or subtracted from 
patterns obtained from bladder biopsies. Further, the gene expression patterns 
of single-cell solutions of bladder tumor cells have been found to be far freer 
of interfering expression of contaminating muscle, submucosal, and connective 

15 tissue cells than biopsy samples. 

Working with human tumor material requires biopsies, and working 
with RNA requires freshly frozen or immediately processed biopsies. Biopsies 
inevitably contain many different cell types in addition to cancer cells, such as 
cells present in blood, connective and muscle tissue, endothelium etc. In the 

20 case of DNA studies, microdissection or laser capture are methods of choice, 
however, the time-dependent degradation of RNA makes it difficult to perform 
manipulation of the tissue for more than a few minutes. Furthermore, studies 
of expressed sequences may be difficult on the few cells obtained via 
microdissection or laser capture, as these may have an expression pattern that 

25 deviates from the predominant pattern in a tumor due to intratumoral 

heterogeneity. 

High-density expression arrays were used to evaluate the impact of 
bladder wall components in bladder tumor biopsies, and tested preparation of 
single cell solutions as a means of eliminating the contaminants. The results of 
30 these evaluations permitted the design of methods of evaluating bladder 
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samples without the interfering background noise caused by ubiquitous 
contaminating submucosal, muscle, and connective tissue cells. 

The evaluating assays of the invention may be of any type. While high- 
density expression arrays can be used, other techniques are also contemplated. 
These include other techniques for assaying for specific mRNA species, 
including RT-PCR and Northern Blotting, as well as techniques for assaying 
for particular protein products, such as ELISA, Western Blotting, and enzyme 
assays. Gene expression patterns according to the present invention are 
determined by measuring a gene product of a particular gene, including mRNA 
and protein. A pattern may be for one or more genes. 

Using the results provided in the accompanying figures and tables, a 
gene is indicated as being expressed if an intensity value of greater than or 
equal to 21 is shown. Conversely, an intensity value of less than 21 indicates 
that the gene is not expressed above background levels. Comparison of an 
expression pattern to another may score a change from expressed to non- 
expressed, or the reverse. Alternatively, changes in intensity of expression may 
be scored, either increases or decreases. Any statistically significant change 
can be used. Typically changes which are greater than 2-fold are suitable. 
Changes which are greater than 3 -fold or 5-fold are highly significant. 

A pattern of characteristic expression of just one gene can be useful in 
characterizing a cell type source or a stage of disease. However, more genes 
may be usefully analyzed. Useful patterns include expression of at least one, 
two, three, five, ten, fifteen, twenty, twenty-five, fifty, seventy-five, or one 
hundred informative genes. As used herein, the phrase "stage-specific 
reference pattern" refers to a pattern of gene expression characteristic of a 
given stage of progression in a bladder tumor. A stage-specific reference 
pattern can include one or more genes listed in Table 4 and/or one or more 
genes listed in Table 8 and/or one or more genes listed in Table 9 and/or one 
or more genes listed in Fig. 6. 

RNA or protein can be isolated and assayed from a test sample using 
any techniques known in the art. They can, for example, be isolated from fresh 
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or frozen biopsy, from formalin-fixed tissue, from body fluids, such as blood, 
plasma, serum, urine, or sputum. 

Stage of a bladder tumor indicates how deeply the tumor has 
penetrated. Superficial tumors are termed Ta, and T M are used to describe 
5 increasing degrees of penetration into the muscle. The grade of a bladder 
tumor is expressed on a scale of I-IV (1-4). The grade reflects the cytological 
appearance of the cells. Grade I cells are almost normal. Grade II cells are 
slightly deviant. Grade III cells are clearly abnormal. And Grade IV cells are 
highly abnormal. 

10 As used herein, the reference to genes which are expressed in 

"submucosal, smooth muscle, or connective tissue" or patterns of expression 
in "other cell types" can include the expression of one or more of the genes 
listed in Table 1 and/or one or more of the genes listed in Table 6. The term 
"connective tissue cell" includes any stromal cell such as fibroblasts, 

15 macrophages, mast cells, granulocytes, etc. The data provided herein of 
expression for submucosal, smooth muscle, and connective tissue can be used 
in at least three ways to improve the quality of data for a tested sample. The 
genes identified in the data as expressed can be excluded from the testing 
altogether or tested but eliminated from the analysis. Alternatively, the 

20 intensity of expression of the genes expressed in the submucosal, smooth 
muscle, and/or connective tissue can be subtracted from the intensity of 
expression determined for the test tissue. 

Patterns can be compared manually (by a person) or by a computer or 
other machine. An algorithm can be used to detect similarities and differences. 

25 The algorithm may score and compare, for example, the genes which are 
expressed and the genes which are not expressed. Alternatively, the algorithm 
may look for changes in intensity of expression of a particular gene and score 
changes in intensity between two samples. A variety of such algorithms are 
known in the art. Similarities may be determined on the basis of genes which 

30 are expressed m both samples and genes which are not expressed in both 
samples or on the basis of genes whose intensity of expression are numerically 
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similar. Differences are considered significant when they are greater than 2- 
fold, 3-fold or 5-fold from the base value. Alternatively, a mathematical 
approach can be used to conclude whether differences in the gene expression 
exhibited by different samples is significant (see, e.g., Golub et al., Science 286, 
5 53 1 (1999). One approach to determine whether a sample is more similar to 
or has maximum similarity with a given condition (e.g., a particular grade or 
stage of tumor progression) is to compare the Euclidean distances (see Golub 
et al. and Example 6) between the sample and one or more pools representing 
different conditions for comparison; the pool with the smallest vector angle is 

10 then chosen as the most similar to the test sample among the pools compared. 

The data collected and disclosed here as "connective tissue" is 
presumed to contain both smooth muscle and submucosal gene expression as 
well. Thus it represents the composite expression of those cell types which can 
typically contaminate a bladder tumor biopsy. 

15 Genes identified as changing in various stages or grades of bladder 

cancer can be used as markers for drug screening. Thus by treating bladder 
cancer cells with test compounds or extracts, and monitoring the expression 
of genes identified as changing in the progression of bladder cancers, one can 
identify compounds or extracts which change expression of genes to a pattern 

20 which is of an earlier stage/grade or even df normal urothelium. As used 
herein, the reference to expression of genes in "normal urothelium" or "normal 
urothelial cells" can include one or more genes listed in Table 7. 

As demonstrated below, pools of tumors of a similar stage or grade, 
particularly bladder tumors, can be made and the expression of the pool 

25 evaluated. The expression data of the pool can be used to define a stage or 
grade of tumor. The use of the pool minimizes the variations found from 
individual tumor to individual tumor of the same grade or stage. The 
expression data of the pool can then be used as a comparator to which 
individual tumor samples are compared, in an effort to categorize, 

30 prognosticate, and prescribe the tumor samples. The methods described herein 
for classifying the stage or grade of a tumor can be combined with sequence 
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analysis of genes whose expression is altered compared to normal tissue in the 
individual patient. In particular, mutations in key genes such as tumor 
suppressor genes can help to refine the application of the gene expression 
results to diagnosis and prognosis. 

As used herein, the reference to expression of "genes in bladder cancer" 
or "genes in a bladder tumor" can include one or more genes listed in Table 4 
and/or one or more of the genes listed in Table 8 and/or one or more genes 
listed in Table 9. 

The above disclosure generally describes the present invention. A more 
complete understanding can be obtained by reference to the following specific 
examples which are provided herein for purposes of illustration only, and are 
not intended to limit the scope of the invention. 

EXAMPLE 1 

Quantification of gene expression using microarrays 

Material 

Bladder tumor biopsies were sampled from patients after informed 
consent was obtained, and after removal of the necessary amount of tissue for 
routine pathological examination. Tumors examined were 335 (stage pTa 
gradel), 837 (pTa Grll), 901 (pTa Grill), 320 (pTl Grill), 713 (pT2 Grill). 
RNA from six different tumors of the same stage and grade was combined to 
form each pool. Four such pools were prepared (pTa GrI pool, pTa Grll pool, 
pT2+ Grin pool, and pT2+ GrIV pool). Normal bladder mucosa biopsies from 
36 patients with prostatic hyperplasia or incontinence were pooled (as RNA) 
to obtain a normal urothelial reference. Single cell solutions were made by 
disintegrating biopsies on ice with a scalpel and a syringe followed by filtering 
through a 50 micron filter. 

Preparation of noRNA 

Total RNA was isolated using the RNAzol B RNA isolation method 
(WAK-Chemie Medical GMBH). Poly (A)* RNA was isolated by an oligo-dT 
selection step (Oligotex mRNA kit from Qiagen). 
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Preparation of cRNA. 

One ng mRNA was used as starting material for the cDNA preparation. 
The first and second strand cDNA synthesis was performed using the 
Superscript Choice System (Life Technologies) according to the 

5 manufacturer's instructions, except that an oligo-dT primer containing a T7 
RNA polymerase promoter site was used. Labeled cRNA was prepared using 
the MEGAscrip In Vitro Transcription kit (Ambion). Biotin labeled CTP and 
UTP (Enzo) was used in the reaction together with unlabeled NTP's. 
Following the IVT reaction, the unincorporated nucleotides were removed 

10 using RNeasy columns (Qiagen). 
Array hybridization and scanning 

Ten \ig of cRNA was fragmented at 94 °C for 35 min. in a 
fragmentation buffer containing 40mM Tris-acetate pH 8. 1 , 1 OOmM KOAc, 30 
mM MgOAc. Prior to hybridization, the fragmented cRNA in a 6xSSPE-T 

15 hybridization buffer (1M NaCl, lOmM Tris pH 7.6, 0.005% Triton) was heated 
to 95°C for 5 min and subsequently to 40°C for 5 min before loading onto an 
Affymetrix probe array cartridge (HuGeneFl set array, part No. V900160). 
The probe array was then incubated for 16 h at 40°C at constant rotation (60 
rpm). The washing and staining procedure was performed in the Affymetrix 

20 Fluidics Station. The probe array was exposed to 1 0 washes in 6xSSPE-T at 
25 °C followed by 4 washes in 0.5xSSPE-T at 50°C. The biotinylated cRNA 
was stained with a streptavidin-phycoerythrin conjugate, 10 ng/ml (Molecular 
Probes, Eugene, OR) in 6xSSPE-T for 30 min at 25 °C followed by 10 washes 
in 6xSSPE-T at 25°C. The probe arrays were scanned at 560 nm using a 

25 confocal laser scanning microscope with an argon ion laser as the excitation 

source (made for Affymetrix by Molecular Dynamics). The readings from the 
quantitative scanning were analyzed by the Affymetrix Gene Expression 
Analysis Software. 
Normalisation of data 

30 To compare samples, normalization of the data was necessary. For that 

purpose we compared scaling to total GAPDH intensity (sum of 3', middle, 



5 probe sets) of 7000 units with scaling to a total chip intensity (global scaling) 
of 281850 units (averaging 150 units per probe set). Both gave similar results 
with scaling factors that differed less than ten percent in a set of experiments. 
Based on this we chose the global scaling for all experiments. The variation in 
hybridization intensity following global scaling in biopsies from the bladder wall 
is shown in Fig. 1. 

EXAMPLE 2 

Interference of bladder wall components on expression profiling of bladder 
tumor biopsies 

Biopsies contain epithelial cells that most often are the targets for the 
studies (e.g., in the identification and characterization of carcinoma cells), and 
in addition many other cells that contaminate the epithelial cell fraction to a 
varying extent. The contaminants include histiocytes, endothelial cells, 
leukocytes, nerve cells, muscle cells etc. Microdissection is the method of 
choice for DNA examination, but in case of expression studies this procedure 
is difficult due to RNA degradation during the procedure. Our approach has 
been to gently remove the epithelium and monitor the expression in the 
remaining submucosa and underlying connective tissue (the bladder wall). 
Genes expressed at high or low levels in the bladder wall should be 
interrogated when performing expression monitoring of the urothelium and 
urothelial tumors. A similar approach could be used for studies of epithelia in 
other organs. 

We gently scraped off some of the normal urothelium lining the bladder 
lumen from bladders removed at cystectomy for bladder cancer. Then biopsies 
were taken from the denuded submucosa and connective tissue, reaching 
approximately 5 mm into the bladder wall, and immediately disintegrated in 
guanidinium isothiocyanate. Total RNA was extracted from four different 
cystectomy specimens, pooled, and poly(A)* mRNA was prepared from the 
pool followed by conversion to double-stranded cDNA and in vitro 
transcription into cRNA containing biotin-Iabeled CTP and UTP. 
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The labeled sample was hybridized to a set of 4 arrays containing 7074 
probe sets for human genes. A total of 1491 of the examined genes (21.1%) 
were scored as present, and 120 (1.7%) as present but rare. The percentile 
distribution of the expression intensity was (90%, 1308; 75%, 383; 50%, 163; 
5 25%, 85; 10%, 47). Genes above the 90* percentile (Table 1) were grouped 
according to the purported function of the protein (Table 1, first column). 
Many of the highly expressed genes belong to a group of genes that encode 
proteins involved in transcription and translation, probably reflecting that these 
genes generally are highly expressed in the various cell types present in the 

10 bladder wall, and corresponding to recent data on yeast. Structural proteins 
such as keratins and proline rich proteins are highly expressed whereas collagen 
genes are only medium expressed. Extremely high expression is shown by the 
cystic fibrosis antigen gene, the SI 00 calcium binding protein, the cystatin B 
and the cytokeratin 13 genes that are all above 10,000 units. 

15 To evaluate the influence of bladder wall tissue in urothelial tumor 

biopsies, we monitored the expression level in 3 biopsies from transitional cell 
carcinomas (one superficially invasive (#733-2) and two muscle invasive 
(#733-1 and #879-1). The expression intensity in the tumor biopsies of genes 
that are highly expressed in bladder wall are listed in Table 1 . Many genes are 

20 expressed to the same magnitude in the tumor biopsies as in bladder wall, 82 
genes (5%) were present at a level above 1308 in all samples, and above the 
75% percentile of the bladder wall sample intensity (383 units) 210 genes were 
expressed in all three biopsies as well as in bladder wall. Genes that were not 
expressed in bladder wall but present in the urothelial biopsies amounted to 

25 196. 

Genes that are expressed and genes that are not expressed in bladder 
wall can both interfere with the interpretation of the expression in a biopsy, and 
should be interrogated when interpreting expression intensities in urothelial 
tumor biopsies, as the bladder wall component of a biopsy varies in amount 
30 from biopsy to biopsy. 
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It is remarkable how similarly many genes belonging to the groups 
encoding metabolically active proteins, transcription and translation related 
proteins, mitochondrial and nucleoproteins, are expressed in the different 
samples (Table 1). It seems reasonable to expect that it is incompatible with 
cellular function to stray from a narrow interval regarding these genes. 
Although some of the examined cells are malignant of atypia grade IV, which 
is a severe morphological deviation from normal, the key cellular functions are 
obviously still under strict control. 

Twenty six genes were expressed at an intensity above 1308 in bladder 
wall and more than five times lower in tumor biopsies. These genes, marked 
with bold (Table 1), include keratins (7 genes) encoding proteins like keratins 
type II, 4, and 6. Another prominent group are the genes encoding proline-rich 
proteins (5 genes). These gene expressions can be used to monitor the amount 
of bladder wall present in a given biopsy of tumors. The tumor biopsy 879-1 
obviously has a larger bladder wall component than the other biopsies, as it 
contains keratin 13 and several other highly expressed bladder wall mRNAs at 
a low level, but higher than the other specimens (Table 1). 

An interesting result was the SI 00 calcium binding protein A7 gene 
transcript that was highly expressed in bladder wall and totally absent from the 
other biopsies (also absent from a number of other examined tumor biopsies 
from bladder). As all samples were collected with the same procedures, it 
indicates that this expression is either individual and occurred by chance in the 
patients from whom we removed the bladder wall biopsies, or, more likely, that 
the presence of urothelial RNAases degrade this transcript very fast. 

Bladder tumors have a reduced intercellular cohesion, and easily 
disintegrate into single cell solutions. To eliminate bladder wall cells from the 
urothelial tumor cells, five tumors were disintegrated into single cell solutions 
before extraction of RNA, and compared to three tumors where RNA was 
extracted from the biopsy directly. We expected that this disintegration 
procedure might lead to an enrichment of tumor cells and loss of connective 
tissue cells. Examination of genes highly expressed in bladder wall (Table 2A), 
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showed a similar expression in single cell solutions of bladder tumor cells 
compared to biopsies. However, the level was much lower than seen in the 
bladder wall and raised the question whether the expressed RNAs originate 
from the bladder wall or from the urothelial cells. 
5 To answer this question we examined the expression of genes expected 

to be present in bladder wall (Table 2B). Some of these genes were expressed 
in the bladder tumor samples, and probably indicate the presence of bladder 
wall components in these. It was striking that the single cell solutions contained 
much lower expression levels of these genes compared to the biopsies 

10 (p<0.004). Although the number of examined tumors was small this indicates 
that preparation of single cell solutions may reduce the presence of bladder wall 
cells in the samples. The absence of keratin 8 in the bladder wall sample 
demonstrated that this sample was devoid of urothelial cells (Table 2B). 

The genes known to be related to the bladder wall components, showed 

15 a variable level throughout the samples. Some genes like myosin light chain 2 
gene, fibroblast tropomyosin gene and alpha- 1 collagen type IV gene, were 
generally more expressed in the tumor samples than the other genes (Table 
2B). We hypothesize that this may reflect that there is a differential expression 
of genes in the connective tissue component that inevitably is included in a 

20 tumor biopsy, an expression that may deviate from the one found in the bladder 
wall further away from the tumor. An example of this differential expression 
in bladder wall was the presence of transcript from the myosin light chain gene 
in the tumor samples but not in the bladder wall biopsies (Table 2B). 

In tumors many important events take place in the non-epithelial 

25 compartment. Tumors need a connective tissue support, they need blood 
vessels, they interact with the immune system and have intercellular signaling 
with various sorts of other cells in a complicated way that has not yet been 
clarified in detail. Gene expression originating from the non-epithelial 
compartment contributes to the expression profile of a tumor, and might be of 

30 great importance in relation to the clinical outcome and therapeutic response 
of the tumor. 
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EXAMPLE 1 

Expression profiling of tumor pools to assess individual tumor heterogeneity. 

We expected tumors from the same pathology stage and grade to have 
a more similar gene expression than tumors from different stages and grades. 
5 This was the case; however, these differences were not striking, and there were 
exceptions. Based on this we hypothesized that, a given tumor scored by a 
pathologist at light microscopy to be similar to another one may actually 
represent tumors with a remarkably different gene expression. The reason for 
this could be that tumors do not progress from a well defined stage to another 

10 well defined stage, but rather that a continuous change is taking place at the 
expression level — which is then only partly reflected by morphology. 

We tested this hypothesis by examining the expression which differed 
between a pool of tumors from a given stage and a single tumor, to see 
whether the differentially expressed genes were linked to a certain stage or 

15 whether they were a random deviation. It was evident that tumor expression 

that deviates from the pool systematically have expressions belonging to either 
a lower or a higher stage, or both, or unique expressions not seen in the pool 
(Fig. 3). The low stage Ta tumor had 43 genes whose expressions were 
increased or decreased similar to the ones seen in a Stage 2 grade IV tumor 

20 pool, and only three expressions that were altered in the opposite direction of 
stage 2 grade IV. The stage 2 grade IV tumor had 33 genes whose expression 
was either increased or decreased similar to the ones seen in the Ta superficial 
tumor pool, and only 8 genes altered in the opposite direction. In the T2 grade 
III tumor, gene expressions that were increased or decreased similar to the 

25 superficial or the grade IV invasive pool could be found. Furthermore some 

gene expressions were only low in this grade HI tumor, and higher in both 
superficial and grade IV tumor pools. The clinical information on the examined 
single tumors (Table 3) paralleled the expression findings as the intermediate 
grade HI tumor was the first muscle invasive tumor in a patient who had had 

30 a superficial tumor five months earlier. It seems that this tumor has not reached 
the level of malignancy as seen in the other invasive tumor. The latter was of 
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grade IV, and was a big solid tumor with muscle invasion at first visit. The 
superficial Ta tumor was the fifth recurrence and was followed by two new 
recurrences 64 and 159 days later-also of superficial nature. 

The genes that were identified in lower stage and grade tumors and 
shown to be similar to the expression in high stage tumors are listed in Tables 
4A and 4B. These genes may "signal" a higher stage or grade, or represent a 
transition from low stage or grade to high stage or grade. To aid in avoiding 
interpreting bladder wall expression as tumor cell-specific expression, the 
expression level in bladder wall is listed in Tables 4 A and 4B. Two columns 
are shown which simulate an increase in bladder wall content to 20% and 50% 
of the sample. These columns were obtained by adding a 20% or 50% 
contribution from the "bladder wall" column to the appropriate remaining 
percentage contribution (80% or 50%) from the TaGrll Pool column. Single 
tumor expression level (column labeled "Ta single tumor") was interrogated in 
this context. Expression levels which are unlikely to be due to bladder wall 
contamination are shown in bold; other expressions are shown in regular font. 
We believe this procedure is useful and leads to reliable conclusions. 

These genes form a complex group of genes with highly different 
functions. It is not totally unexpected that mucin synthesis is changed, nor that 
cytokeratin 1 5 is decreased when moving from Ta and to higher stage. The 
gene expressions which signal a higher grade of atypia in already invasive grade 
III tumors are, among others, immunology related genes. This may indicate 
that the more atypical cells are either surrounded by inflammatory cells, or that 
the tumor cells start synthesizing these proteins. Further investigation is needed 
to elucidate this point, and these proteins will be an interesting parameter to 
follow in relation to clinical course in the future. The strong up-regulation of 
cathepsin B may indicate an increased proteolytic attack against the connective 
tissue. 

The cause of the changed expression is unknown and could be either 
a transcriptional regulation or secondary to gain or loss of chromosome 
material. Both mechanisms are known to occur in cancer cells. 
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EXAMPLE 4 

Change of transcript level during the progression of bladder cancer. 

Biopsies from human bladder tumors were analyzed as single tumors 
or as pools of tumors representing the different stages in the progression of the 
5 bladder cancer disease. We used a total of 5 single tumors and 4 tumor pools, 
each pool made by combining six tumors. To generate a normal reference 
material, we pooled biopsies from normal bladder mucosa from 35 volunteers. 
The biopsies were disintegrated into single cell solutions immediately after 
removal, filtered and snap frozen in guanidinium isothiocyanate. From the cell 

10 solutions RNA was extracted, reverse transcribed to cDNA and the cDNA 
transcribed into labelled cRNA, that was incubated on the chip cartridges 
followed by scanning and scaling to a global chip intensity amounting to 150 
units per probe set. The scaling made it possible to compare individual 
experiments to each other. To verify the reproducibility, double determinations 

15 were made in selected cases and showed a good correlation (Fig. 4A). 

We compared gene expression at three different steps in the 
progression of bladder cancer to each other by the use of the normal pool as 
a reference. A scatter plot of the noninvasive pTa grade one tumor and the 
invasive highly abnormal grade four pT2+ tumor showed a minor subfraction 

20 of the gene transcripts to deviate much from those in the normal urothelium. 

The large majority of transcripts were within a narrow range in both tumors 
and normal urothelium (Fig. 4B,C). The number of deviating genes was higher 
in the most abnormal tumor. 

We then analyzed transcripts that showed alterations larger than five- 

25 fold, when comparing three different pools representing the transition from 

normal urothelium to superficial tumor, and further on to invasive transitional 
cell carcinomas (TCC). The method applied consisted in a probe-to-probe 
comparison (20 probes per gene) based on the software GeneChip® Analysis 
Suite 3.1 from Affymetrix, Inc. Increased levels indicate that the transcript is 

30 either upregulated at the stated level or turned on de novo reaching a given fold 
above the background level. Decreased levels in a similar way indicate 
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reduction or loss of transcript. Alterations of a single transcript during the 
progression of the bladder cancer disease can follow several different pathways 
(Fig. 6). Some of the transcript changes reflect the transition from normal 
cells to tumor cells, and are grouped as TCC related genes (Fig. 6 A, B). A 
5 distinct feature of group A was the presence of 6 smooth muscle related genes. 
Others are altered only in superficial tumors, not in invasive tumors, and are 
grouped as Bladder papilloma related genes (Fig. 6C, D). Group C, with 
downregulated genes, contained 15 immunology-related genes. Group D 
contained a variety of genes encoding proteins with different functions. Finally 

10 some genes only showed an alteration in invasive tumors and are grouped as 
Invasive TCC related genes (Fig. 6E, F). The genes in group E encoded 
functionally unrelated proteins, whereas group F contained 5 immunology- 
related genes. Thus, it seemed possible to define groups of genes whose 
expression level is associated with the stage of bladder tumors. 

15 EXAMPLE 5 

Cluster Analysis 

The level of a gene transcript during disease progression can be thought 
of as a pattern that can be correlated to patterns of other gene transcripts. If 
the expression of one gene is very similar to the expression of another gene in 

20 several samples they are a correlated pair of genes. This pair of genes can then 
be correlated to other genes with a similar transcriptional behavior in the set of 
tissues examined, and together these constitute a gene cluster. In the next step 
the relation between clusters is established and a dendrogram of genes is 
formed, in which strongly correlating gene clusters are near each other. The 

25 principles are described in Eisen et al., Proc. Natl. Acad. Sci. USA 95, 14863 
(1998). Briefly, each gene vector was placed in its own cluster, where the 
cluster prototype was set to the gene vector. All pair-wise vector angles 
between cluster prototypes were calculated. The smallest vector angle was 
identified, and those clusters were merged as a weighted average of the two 

30 prototypes (and also a weighted average of all the gene vectors each prototype 
represented). The vector angles were then updated between the newly merged 
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clusters and the merger process was repeated. The final clusters are displayed 
in the order in which they were merged. 

Exactly the same procedure used to cluster genes can be used to cluster 
the tissue samples, showing the relation between the different tissues based on 
5 their gene expression. We based clustering analysis on either the 4067 

transcripts being scored as present in at least one of the samples, or based on 
those 400 transcripts (see Table 9) that covaried best with a weighting scheme 
adding increasing values to increasing stages. 

The scaled AvgDif measures as calculated by the Affymetrix software 

10 were extracted for the normal pool and each of the graded tissues. Only the 
4067genes with an AbsCall of P (present) in at least one of the tissues were 
considered. All AvgDif measures below 20 were set to 20. For each tissue 
and each gene, the AvgDif from the normal pool was either subtracted, to 
define the "absolute difference, "or divided and natural logarithm applied to 

15 define the "log-fold" relative measure. The relative expression measures for 
each tissue (log-fold or absolute difference) were used to cluster tissues by a 
hierarchical method using the Euclidean distance between tissues. Tissue 
dendrograms were constructed with the PHYLIP program using clustering 
order and distances. A weighting scheme (see Example 6) for the seven 

20 observed stages and grades of cancer was used to select 200 positively 
covarying and 200 negatively covarying genes with respect to progression. 
The same hierarchical method and a normalized Euclidean distance (vector 
angle) were used to cluster the top 400 positively and negatively covarying 
genes for both relative expression measures. Gene dendrograms were 

25 constructed by the same method as for the tissue dendrograms. 
Tissue clusters 

Different algorithms based on either fold change or absolute differences 
in transcript levels across the different samples were applied to all transcripts 
or only those covarying with a progression scale. Both methods were able to 
30 cluster the tissues according to the tumor's or tumor pools stage and grade of 
atypia in a meaningful way (Fig. 5). The two noninvasive and the two invasive 
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pools each clustered very closely together both using the fold change and the 
absolute difference, indicating a close genetic relation between these, and 
indicating that one effect of pooling samples is a reduction of the variation in 
gene expression. The single tumor preparations showed a more varied 
5 distribution but still reflected the stage of the tumor. In the log-fold 
dendrograms (Fig. 5A, C) the superficial tumors 335, 837 and 901 cluster close 
to the superficial pools, but the pTa grade III tumor 901 seems closer to the 
superficial pools than the pTa grade II tumor 837. This may either be due to 
the variation in histopathological grading or due to the tumors having different 

10 genetic properties. The minimally invasive pTl grade IH tumor 320 is 
correctly placed in between the muscle invasive and the superficial tumors, and 
the muscle invasive tumor 713 is placed very close to the pools of pT2+ 
tumors. Tumor 713 seemed to be closer to the pT2+grade IV than Grade III 
pool although it was histopathologically scored as grade HI. In the absolute 

15 difference dendrogram (Fig. 5B, D) the superficial tumors 837 and 901 are 

closely related to the superficial pools, the pTl superficially invasive tumor is 
less related and finally the invasive tumor 713 located closest to the invasive 
pools. An exception was the superficial tumor pTa grade I, 335-6 that deviated 
from all other tumors. Whether this tumor has unique properties is unknown, 

20 however it did not deviate from the expected location in the dendrograms 
based on fold change. 

The dendrograms show that the clustering algorithms work very well, 
that the dataset obtained from the oligonucleotide arrays reflect the biological 
properties of the tumors, and that objective information on a tumor* s stage and 

25 grade can be obtained from mathematical analysis of gene expression data. 

Furthermore, it is seen that when ranking based on covariance to the 
progression is used to extract the top 10% covarying genes, these have a 
dendrogram that is almost identical to the one based on 4067 genes. We 
therefore used the ranking procedure when analyzing gene clusters. 
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Gene clusters 

The data obtained from cluster analysis are presented as colored images 
in which genes with similar expression patterns are clustered next to each other 
on the vertical axis and the samples according to stage and grade on the 
5 horizontal axis (Fig. 7). The color of each cell in the tabular image represents 
the ratio between the sample expression of the gene in question and the 
expression in normal urothelium. The color saturation is directly proportional 
to the magnitude of the measured expression ratio, cyan indicating the lowest 
ratio, yellow indicating the highest ratio. Black indicates a ratio of one, a 

10 similar level of expression in tumor as in normal urothelium. The two different 
clustering methods, log-fold and absolute difference gave completely different 
clusters across the set of samples (Fig. 7). 

In the log-fold based cluster analysis, the top 200 positively covarying 
genes can be divided into five different clusters containing functionally related 

15 genes (Fig. 7, left upper column). The cluster shown at the top contains genes 

related to cell proliferation such as cyclins A and E, PCTAIRE-1, and 
SWI/SNF. The next cluster mainly contains oncogenes and growth factors. 
Genes in both these clusters are expressed at a level close to that seen in 
normal urothelium in superficial tumors (black) and increase during disease 

20 progression (yellow). The two clusters at the lower part show a reduced 
expression level in the superficial tumors compared to normal (cyan) and then 
an increase above the normal urothelial level in invasive tumors (shades of 
yellow). These clusters contain a set of immunologically related genes, like 
different MHC's and immunoglobulins, cancer related genes like src-Iike kinase 

25 and Fas/Apo-1, and finally another immunologically related cluster at the 
bottom. 

The 200 negatively covarying genes (Fig. 7, left lower column) could 
be divided into three different clusters based on log-fold change and function 
of the genes. The upper cluster contains genes related to cell adhesion like 
30 laminins, integrins and P-cadherin (Fig. 7, left lower column). They all show 
a reduced level of expression in the invasive tumors as evidenced by the cyan 
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coloring to the right. The small middle cluster contains four genes related to 
transcription, and finally the lowest cluster in the figure contains five 
proteinases, like cathepsin E (two different probe sets for the same gene) and 
metalloproteinase as well as a protease inhibitor. The lower clusters are 
characterized by an increase in level in superficial tumors (yellow) followed by 
a reduction to a level below normal urothelium in invasive tumors. 

In the absolute difference based cluster analysis the top 200 covarying 
genes that showed a positive covariance contained only few clusters having a 
functional relation. The upper cluster (Fig. 7, right upper column) contained 
five genes related to cell proliferation like the microtubule-associated protein 
and oncoprotein 18/stathmin. The next cluster was a set of immunology related 
genes like MHC and LERK-2. Both these clusters showed an increased 
expression level in invasive tumors compared to normal urothelium. The cluster 
at the lower end of the figure showed a reduced level in superficial tumors and 
a return to normal or increased level in invasive tumors. This cluster contained 
many immunology-related genes like MHC, HLA and immunoglobulin genes. 
Finally, for genes that showed a negative covariance based on absolute 
difference (Fig. 7, right lower column), this was mainly due to clustering of 
ribosomal genes. A very tight cluster in the middle of the graph show 
ribosomes that are upregulated in expression in superficial tumors and 
downregulated or unaltered in invasive tumors. The middle ribosomal cluster 
is generally expressed at a lower level than in normal urothelium, whereas the 
cluster at the bottom of the figure is similar to the one in the middle. Other 
genes that seemed to cluster were a small tight cluster of immunology related 
genes, and two tumor inhibitors, TGF-beta superfamily protein and Suil in the 
uppermost cluster. 

Thus, a pattern of altered gene transcription occurs during the 
progression of bladder cancer that involves a number of genes belonging to 
functionally different gene families. Cluster analysis identified many biologically 
relevant genes, and in that aspect was superior to the probe-by-probe 
comparison described above. 
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where dp(X,Y)= 

and lenCX^sqrt^'-'* ) 



After the sample had been compared in this way to each pool, invasive and 
non-invasive, the pool which differed from the sample by the smaller angle was 
5 determined to be the class of the sample. 

Samples from ten bladder tumors were assigned by the classifier, and 
the results are shown in Fig. 8. The classifier was able to correctly call the 
presence or absence of muscle invasion in all ten samples examined (7 
non-muscle invasive, 3 muscle invasive), based on categorizations made by a 

10 pathologist. To cross- validate the classification methodology, two non- 
invasive (Ta) and two invasive (T2) pools were compared with tumors from 
each of 10 patients. Since two pools were available for each stage, four 
possible combinations (classifier sets) of one Ta pool and one T2 pool were 
tested. The classification rate was calculated as the number of tumors correctly 

15 identified times 10. The results for each of the four classifier sets were 

averaged to obtain the data shown in Fig. 8. 

EXAMPLE 7 

Confirmation of microarray expression analysis by Northern blotting 

In order to confirm the array data, Northern blotting was performed on 
20 the same samples of RNA as used for array hybridization. A standardized 

amount of RNA was run in each lane, followed by blotting with a labelled RNA 

probe, and quantitation of the band obtained (Fig. 9). 

Total RNA, 0.5-4 ^g per lane, was separated in 1.5% agarose- 

formaldehyde gels, transferred onto Zeta-Probe® nylon membrane (Bio-Rad) 
25 by positive pressure (Posiblotter, Stratagene) and immobilized by baking for 

20 min at 120°C. The filters were hybridized with digoxygenin- labelled (DIG) 

RNA transcribed from 600-1000 bp PCR products containing a T7 promotor 
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incorporated via the antisense primers. Filters were hybridized with 10 ng 
probe per ml of ultrahyb™ hybridisations solution at 68°C for 16 h and washed 
to a stringency of 0. lx SSC at 68°C. Specific hybridization was detected by 
reacting the membrane with monoclonal anti-DIG antibodies conjugated with 
alkaline phosphatase, incubating with ECF chemifluorescence substrate 
(AmershamPharmacia) and scanning on a Storm 840 (Molecular Dynamics). 
The hybridization signals were quantified with ImageQuant 5.0 software. 

As can be seen from the plots, the oligonucleotide array and the 
Northern blot gave similar results with the different probes, both in genes 
expressed at a high level (beta2-microglobulin), and those expressed at a very 
low level (CD59). 

EXAMPLE 8 

Immunohistoch'emical localization of expressed proteins 

The biopsy samples used to study gene expression in bladder tumors 
contain cells other than urothelial cells, although the amount of other cells 
should be limited due to the use of single cell solutions. We therefore used 
immunostaining of tissue sections from the single tumors examined to 
determine which cells expressed the protein encoded by the transcript in 
question. We used the transcript levels to select a group of proteins supposed 
to show variation from sample to sample, making possible a rough correlation 
between level of protein detected and intensity of the transcript on the 
microarray. 

Four pm sections were cut from paraffin-embedded tissue blocks, 
mounted, and deparaffinized by incubation at 80°C for 10 min, followed by 
immersion in heated oil at 60°C for 10 min (Estisol 312, Estichem A/S, 
Denmark) and rehydration. Antigen retrieval was achieved in TEG (Tris- 
EDTA-Glycerol) buffer using microwaves at 900 W. The tissue sections 
cooled in the buffer for 15 min before a brief rinse in tap water. Endogenous 
peroxidase activity was blocked by incubating the sections with 1% H 2 0 2 for 
20 min, followed by three rinses in tap water, 1 min each. The sections were 
then soaked in PBS buffer for 2 min. The next steps were modified from the 



31 



WO 00/52204 



PCT/IB00/00367 



descriptions given by Oncogene Science Inc., in the Mouse 
Immunohistochemistry Detection System, XHCOl (UniTect, Uniondale, NY, 
USA). Briefly, the tissue sections were incubated overnight at 4°C with 
primary antibody (against beta-2 microglobulin (Dako), cytokeratin 8, 
5 cystatin-C (both from Europa,US), junB, CD59, E-cadherin, apo-E, cathepsin 

E, vimentin, IGFII (all from Santa Cruz), followed by three rinses in PBS 
buffer for 5 min each. Afterwards, the sections were incubated with 
biotinylated secondary antibody for 30 min, rinsed three times with PBS buffer 
and subsequently incubated with ABC (avidin-biotinlylated horseradish 

10 peroxidase complex) for 30 min, followed by three rinses in PBS buffer. 
Staining was performed by incubation with AEC (3-amino-ethylcarbazole) for 
10 min. The tissue sections were counter stained with Mayers hematoxylin, 
washed in tap water for 5 min. and mounted with glycerol-gelatin. Positive and 
negative controls were included in each staining round with all antibodies. 

15 We found several of the proteins to be expressed not only by urothelial 

cells but also by leukocytes, endothelial cells or histiocytes (Table 10, Fig. 10). 
Of the examined proteins only keratin 8 and ApoE were confined to 
urothelium; the other proteins were also present in other cell types. Based on 
the assumption that transcript and protein originates from the same cell, this 

20 clearly indicates that conclusions on the origin of the transcripts requires a 
histological examination, or other verification procedure. The amount of 
stroma in a biopsy, the vascularization (amount of endothelial cells), the level 
of leukocyte infiltration, and the grade of atypia of the urothelial cells were all 
parameters that seemed to influence the level of a given transcript. 

25 The level of protein identified by immunostaining, disregarding the cell 

type expressing the protein, correlated well with the transcript level measured 
on the microarray (Fig. 10). However, no attempt was made to quantitate the 
immunostaining due to the often large heterogeneity in staining across the 
sections. 
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Gene name covariance 

TGF-beta superfamily protein, av dif neg 

Qip1, :karyopherin alpha 4 av dif neg 

hunc18b2, av dif neg 

26S proteasome subunit p44.5, av dif neg 

galectin-9 isoform, av dif neg 

Chromosome 16 BAG clone CIT987 av dif neg 

PAC clone DJ51 5N1 from 22q1 1 .2 av dif neg 

DNA from overlapping chromosom av dif neg 

DMA from overlapping chromosom av dif neg 

DNA from chromosome 19p13.2 co av dif neg 

uroplakin II mRNA, av dif neg 

DNA mismatch repair protein \( av dif neg 

Luman mRNA, :Luman "mRNA," / av dif neg 

SKB1 Hs mRNA, :skb1 \(S. pombe av dif neg 

lipocortin II, :annexin II av dif neg 

fetal liver cytochrome P-450 av dif neg 

gene for thymidylate synthase, av dif neg 

gene for enteric smooth muscle av dif neg 

2-oxoglutarate dehydrogenase, av dif neg 

interleukin 2 receptor gamma av dif neg 

interleukin 2 receptor gamma av dif neg 

MSS1, : proteasome \(prosome, av dif neg 

protein-tyrosine phosphatase, av dif neg 

P1 ATP synthase subunit c, : av dif neg 

tumor-associated 120 kDa nucl av dif neg 

KIAA0018gene, :KIAA0018gen av dif neg 

fatty acids omega-hydroxylase av dif neg 

eukaryotic initiation factor av dif neg 

MGC-24, : "MGC-24,* complete av dif neg 

homolog of yeast ribosomal pro av dif neg 

ATP synthase alpha subunit, av dif neg 

mitochondrial 3-oxoacyl-CoA t av dif neg 

ATP synthase gamma-subunit \( av dif neg 

8-oxo-dGTPase, : "8-oxo-dGTP av dif neg 

PACAP receptor, adenylate c av dif neg 

PACAP receptor, adenylate c av dif neg 

precursor of P1 00 serine prot av dif neg 

KIAA0030 gene, av dif neg 

KIAA0120gene ( :transgelin2 av dif neg 

ribosomal protein, : ribosom av dif neg 

KIAA0112gene, av dif neg 

KIAA0112gene, av dif neg 

mRNA, clone:RES4-4. av dif neg 

mRNA, clone:RES4-4. av dif neg 

KIAA0036 gene, :KIAA0036gen av dif neg 

integrin alpha subunit, av dif neg 

ribonuclease A V(RNase A\), av dif neg 

RNAhelicase, :DEAD/H \(Asp- av dif neg 

gene for dihydrolipoamide succ av dif neg 

proteasome subunit HsC7-l, : av dif neg 

ATP synthase B chain, 5'UTR av dif neg 

mRNA \(KIAA00167\), partial se av dif neg 

gene for hepatitis C-associate av dif neg 

proteasome subunit Y, : prot av dif neg 
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D29641_at 

D29958_at 

D30655_at 

D31764_at 

D31764_at 

D31883_at 

D31884_at 

D31884 at 

D31891 at 

032129 J_at 

D38047_at 

D38555_at 

D38583_at 

D42046lat 

D42046_at 

D42047_at 

D43682_s_at 

D45370_a7 

D49396_at 

D49488_at 

D49728_at 

D49824_s_at 

D50640_a7 

D63478_at 

D63479_s_at 

D63485 aT 

D63486_at 

D63851_at 

D78129_at 

D78275_at 

D79205_at 

D79984_s_at 

D79984_s_at 

D80002_a7 

D82345_at 

D86425 at 

D86974_at 

086985 at 

D87258_at 

D87735_at 

D87953_at 

D89052_at 

HG1034^HT1034 

HG1400-HT1400 

HG1428-HT1428. 

HG1428-HT1428 

HG1515-HT1515" 

HG1515-HT1515. 

HG1614-HT1814. 

HG1614-HT1614] 

HG180O-HT1823 

HG1872-HT1907* 

HG1872-HT1907 

HG198O-HT2023 

HG2147-HT2217 



HUMORFA02 
HUMORFA10 
HUMELF4AII 
HUM0RFKG1C 
HUM0RFKG1C 
HUM0RFKG1L 
HUM0RFKG1M 
HUM0RFKG1M 
HUMORFKG1T 
HUMHLAAD 
HUMPSP31 
HUMORF008 
HUMCOLO 
HUMKIAAJ 
HUMKIAAJ 
. HUMKIAAK 
HUMVLCAD 
HUMUPST1 
HUMAOP1 
HUMHTTP 
HUMNAK1 
HUMHLABAA 
D50625S16 
KIAA0144 
D63479 
KIAA0151 
KIAA0152 
D63851 
HUMHL1115B 
PSMC6 
D79205 
079984 
079984 
D80002 
D82345 
086425 
D86974 
KIAA0232 
087258 
RPL14 
D87953 
ATP6F 
f_at <empty> 
s_at <empty> 
s_at HG1428-HT1428 
s_at HG1428-HT1428 
f_at HG1515-HT1515 
f_at HG1515-HT1515 
at HG1614-HT1614 
at HG1614-HT1614 
at <empty> 
at <empty> 
at <empty> 
at <empty> 
j_at <empty> 



KIAA0052 gene, av dif neg 

KIAA01 16 gene, av dif neg 

eukaryotic initiation factor av dif neg 

KIAA0064 gene, :KIAA0064 gen av dif neg 

KIAA0064 gene, :KIAA0064gen av dif neg 

KIAA0059 gene, :KIAA0059"g av dif neg 

KIAA0063 gene, :KIAA0063gen av dif neg 

KIAA0063 gene, :KIAA0063 gen av dif neg 

KIAA0067 gene, :SET domain, av dif neg 

HLA ciass-l \(HLA-A26\) heavy av dif neg 

28S proteasome subunit p31 , av dif neg 

KIAA0079 gene, :Sec24p, S. C av dif neg 

calgizzarin, : "caigizzarln, av dif neg 

KIAA0083 gene, :DNA2\(DNAr av dif neg 

KIAA0083 gene. :DNA2 \(DNA r av dif neg 

KIAA0089 gene, :KIAA0089"g av dif neg 

very-long-chain acyl-CoA dehy av dif neg 

apM2 GS2374 \(unknown product av dif neg 

Apo1J(MER5\(Aop1-Mouse\)-lik av dif neg 

alpha-tocophero! transfer pro av dif neg 

NAK1 DNA binding protein, av dif neg 

HLA-B null allele mRNA. :HLA-B av dif neg 

DNA for phosphodiesterase 3B, av dif neg 

KIAA0144 gene, :KIAA0144gen av dif neg 

KIAA01 45 gene, av dif neg 

KIAA0151 gene, :KIAA0151 gen av dif neg 

KIAA0152gene, :KIAA0152gen av dif neg 

unc-1 8 homologue, av dif neg 

squalene epoxidase, av dif neg 

proteasome subunit p42, :pro av dif neg 

ribosomal protein L39, av dif neg 

KIAA01 62 gene, av dif neg 

KIAA01 62 gene. av dif neg 

KIAA01 80 gene. av dif neg 

NB thymosin beta, av dif neg 

osteonidogen, av dif neg 

KIAA0220 gene, :"KlAA0220" av dif neg 

KIAA0232 gene, :KIAA0232gen av dif neg 

serin protease with IGF-bindi av dif neg 

ribosomal protein L14, :ribo av dif neg 

RTP, : •RTP," complete cds av dif neg 

proton-ATPase-like protein, av dif neg 

<empty> av dif neg 

<empty> av dif neg 

Globin,"" Beta" av dif neg 

Globin,'* Beta" av dif neg 

Transcription Factor Btf3b av dif neg 

Transcription Factor Btf3b av dif neg 

Protein Phosphatase "1 Alp av dif neg 

Protein Phosphatase "1 ," Alp av dif neg 

<empty> av dif neg 

<empty> av dif neg 

<empty> av dif neg 

<empty> av dif neg 

<empty> av dif neg 
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HG21 47-HT2217_r_at 
HG2149-HT2219_at 
HG2167-HT2237_at 
HG2197-HT2287_SJtf 
HG2238-HT2321_s_a* 
HG2239-HT2324_r_at 
HG2239-HT2324_r_at 
HG2264-HT2360_at 
HG2279-HT2375_at 
HG2566-HT4867_at 
HG2788-HT2898_at 
HG2815-HT2931_at 
HG2815-HT2931_s_at 
HG2815-HT4023_s_at 
HG2873-HT3017_at 
HG2917-HT3061_f_at 
HG2917«HT3061_f_at 
HG2981-HT3127_s_at 
HG2994-HT485(Ls_at 
HG3039-HT3200_at 
HG3076-HT3238_s_at 
HG3107-HT3283_s_at 
HG3107-HT3283_s_at 
HG311-HT311_at 
HG3214-HT3391jat 
HG3236-HT3413J_at 
HG3254-HT3431_at 
HG3342-HT3519_s_at 
HG3364-HT3541_at 
HG33-HT33_at 
HG3484-HT3678_s_at 
HG3514-HT3708_at 
HG3543-HT3739_at 
HG3543-HT3739_at 
HG3549-HT3751_at 
HG3570-HT3773_at 
HG3576-HT3779J_at 
HG3576-HT3779J_at 
HG3731-HT4001_at 
HG384.HT384.at 
HG384^T384_at 
HG3945-HT4215_at 
HG3991-HT4281_r_at 
HG4020-HT4290_s_at 
HG4258-HT4528_at 
HG4319-HT4589_at 
HG4336-HT4606_at 
HG4533-HT4938_at 
HG4542-HT4947_at 
HG4557-HT4962_r_at 
HG4668-HT5083_s_at 
HG4668-HT5083_s_at 
HG474&.HT5197_aT 
HG613-HT613_at 
HG613-HT613_at 



<empty> 
<empty> 
<empty> 
HG2197-HT2267 
HG2238-HT2321 
<empty> 
<empty> 
<empty> 
HG2279-HT2375 
HG2566-HT4867 
HG2788-HT2896 
<empty> 
<empty> 
<empty> 
<empty> 
HG2917-HT3061 
HG2917-HT3061 
<empty> 
<empty> 
<empty> 
HG3076-HT3238 
<empty> 
<empty> 
<empty> 
<empty> 
<empty> 
<empty> 
HG3342-HT3519 
HG3364-HT3541 
HG33-HT33 
<empty> 
HG3514-HT3708 
HG3543-HT3739 
HG3543-HT3739 
HG3549-HT3751 
HG3570-HT3773 
<empty> 
<empty> 
<empty> 
HG384-HT384 
HG384-HT384 
<empty> 
<empty> 
HG4020-HT429Q 
<empty> 
HG4319-HT4589 
<emply> 
<empty> 
HG4542-HT4947 
<empty> 
<empty> 
<empty> 
<empty> 
HG613-HT613 
HG613-HT813 



<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


t—Collage;" Type ""Vii,"" 


av dif neg 


:"Nuclear Mitotic Apparatus P 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


:Triosephosphate Isomerase 


av dif neg 


:Microtubule-Associated Prote 


av dif neg 


:Calcyclin 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


:"Major Histocompatibility 


av dif neg 


-."Major Histocompatibility 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


:"Heterogeneous Nuclear Ribon 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


:W1 


av dif neg 


:Ribosomal Protein L37 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


iTropomyosin Tm30nm," Cytosk 


av dif neg 


:lnsulin-Like Growth Factor 2 


av dif neg 


:lnsulin-Like Growth Factor 2 


av dif neg 


iWilnVS Tumor-Related Protein 


av dif neg 


Protein Phosphatase Inhibito 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


:Ribosomal Protein L26 


av dif neg 


:Ribosomal Protein L26 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


[Transglutaminase 


av dif neg 


<empty> 


av dif neg 


-.Ribosomal Protein L5 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


:Ribosomal Protein L10 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


<empty> 


av dif neg 


:Ribosomal Protein S12 


av dif neg 


:Ribosomal Protein S12 


av dif neg 
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HG821-HT821 at 

HG88O-HT880_at 

HG880-HT880_at 

HG987-HT987jrt 

JQ0105_s_at 

J02611_at 

J02611_at 

J02683_s_at 

J02783_a7 

J02874_at 

J02874_at 

J02902_at 

J02906_at 

J03077_s_at 

J03242_s_at 

J03242_s_at 

J03592_a7 

J03756_at 

J03801_Lat 

J03909_at 

J03909_at 

J03934_s_at 

J04093_s_at 

J04093_s_at 

J(M152_ma1_s_at 

J04164_at 

J04164_at 

J04173_at 

J04611_at 

J04615_at 

J04617_s_at 

J04973_a7 

J05038_s_at 

J05038_s_at 

J05272_a7 

K02405_f_at 

K03189_f_at 

K03430_at 

K03430_at 

L00634_s_at 

L02328J_at 

L04270_a"t 

L04483_s_at 

L0449<La7 

L05072_Aat 

L05188_f_at 

L06499_at 

L06505_at 

L06797_s_at 

L06797_s_at 

L07D44_a7 

L08666_at 

L09209_s_at 

L10413_a7 

L11566lat 



<emply> <empty> av dif neg 

<empty> <empty> av dif neg 

<empty> <empty> av dif neg 

HG987-HT987 :Mac25 :Mac25 av dif neg 

HSMGLO messenger RNA fragment for the av dif neg 

HUMAPOD apoiipoprotein D mRNA, :apoii av dif neg 

HUMAPOD apoiipoprotein D mRNA, ;apoli av dif neg 

HUMATPC ADP/ATP carrier protein mRNA, av dif neg 

HUMTHBP thyroid hormone binding protei av dif neg 

HUMALBP adipocyte lipid-binding protei av dif neg 

HUMALBP adipocyte lipid-binding protei av dif neg 

HUMP2A protein phosphatase 2A regulat av dif neg 

HUMCYPIIF cytochrome P450IIF1 protein \( av dif neg 

HUMGLBA co-beta glucosidase \(proactiv av dif neg 

HUMGFIL2 insulin-Ike growth factor II m av dif neg 

HUMGFIL2 insulin-Ike growth factor II m av dif neg 

HUMTLCA ADP/ATP translocase mRNA, y e av dif neg 

HUMGHVA growth hormone-variant \(GH1\) av dif neg 

HUMLS2 lysozyme mRNA, complete cds wi av dif neg 

HUMIIP gamma-interferon-inducibie pro av dif neg 

HUMIIP gamma-interferon-inducible pro av dif neg 

HUMNMOR Human, NAD\(P\)H:menadione oxi av dif neg 

HUMUGT1 FA phenol UDP-glucuronosyltransfe av dif neg 

HUMUGT1 FA phenol UDP-glucuronosyltransfe av dif neg 

HUMGA733A gastrointestinal tumor-associa av dif neg 

HUM927A interferon-inducible protein 9 av dif neg 

HUM927A interferon-inducible protein 9 av dif neg 

HUMPGAM phosphoglycerate mutase \(PGAM av dif neg 

HUMANP70 lupus p70 \(Ku\) autoantigen p av dif neg 

HUMSNRAA lupus autoantigen \(small nucl av dif neg 

HUMEF1A elongation factor EF-1-alpha g av dif neg 

HUMCOR2M cytochrome bc-1 complex core p av dif neg 

HUMCTSE cathepsin E mRNA, :cathepsin av dif neg 

HUMCTSE cathepsin E mRNA, .cathepsin av dif neg 

HUMIMPH IMP dehydrogenase type 1 mRNA av dif neg 

HUMMHDC3B MHC class II HLA-DC-3^beta gen av dif neg 

HUMCGBEL03 chorionic gonadotropin beta su av dif pos 

HUMC1 QB2 complement C1 q B-chain gene, e a v dif pos 

HUMC1 QB2 complement C1q B-chain gene, e av dif pos 

HUMFPTA famesyl-protein transferase a av dif pos 

HUMPREBLYM \(clone Hu lambda-17\) lambda- av dif pos 

HUMTNFRRP \(clone CD1 8\) tumor necrosis av dif pos 

HUMRPS21 X ribosomal protein S21 \(RPS21 av dif pos 

HUMNADH \(clone CC8\) NADH-ubiquinone av dif pos 

HUMIFNRF1 A interferon regulatory factor 1 av dif pos 

HUMSPRR2B small proline-rich protein 2 av dif pos 

HUMRPL37A ribosomal protein L37a \(RPL37 av dif pos 

HUML12A ribosomal protein L1 2 mRNA, : av dif pos 

HUMGPCR \(clone L5\) orphan G protein- av dif pos 

HUMGPCR \(clone L5\) orphan G protein- av dif pos 

HUMCCDPKB calctum/calmodulin-dependent p av dif pos 

HUMPORIN porin \(port) mRNA, complete c av dif pos 

HUMAMYLOID amyloid protein homologue mRNA av dif pos 

HUMFTA famesyltransferase alpha-subu av dif pos 

HUMRPL1 8A ribosomal protein L1 8 \(RPL1 8 av dif pos 
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L11672_at 


HUMKRUPZN 


L11708_at 


HUMB17HSD 


L11708_at 


HUMB17HSD 


L12711_s_at 


HUMTRANSKE 


L12711_s_at 


HUMTRANSKE 


L1 9493 Vat 


HUMFMR1R 


L19527_at 


HUMRPL27 


L19686_ma1_at 


HUMMIF 


L19686_ma1_at 


HUMMIF 


L19779_at 


HUMH2A2A 


L20688_at 


HUMLYGDI 


L20941_at 


HUMFERRITH 


L21954_at 


HSPBR4 


L21954_at 


HSPBR4 


L26247_at 


ill HIOI 1 1 1 0 f~\ 


L27943_at 


HUMCYDc 


L3286o_at 


ill IMCDDIklD 

HUMEPR 1NP 


L32978_at 


1 II II II Jl l/A A 

HUMMLK3A 


L33075_at 


HUMIQGA 


L33243_at 


1 II U irM/fN^ A 

HUMPKD1A 


L33842jna1_at 


ft ti it ti i mrM ft 

HUMIMPDH 


L33842_ma1_at 


HUMIMPDH 


L33930_s_at 


HUMCD24B 


L37127_at 


HUMRP1A 


L38490_s_at 


HUMADPRF 


L38928_at 


HUMMETSYN 


L38941_at 


HUMRPL34A 


L39059_at 


HUMTFSL1A 


L40357_at 


HUMTRIP7M 


L40379_at 


HUMTRIP10M 


L40387_at 


HUMTRIP14G 


L40392_at 


HUMORFB 


L40904_at 


HUMPPARGB 


L41870_at 


HUMRB1MRNA 


L42176_at 


HUMDRAL 


L42373_at 


HUMPP2A 


L42542_at 


HUMRIP1R 


L76159_at 


HUMFRG1R 


L76465_at 


i •■ ii m/sm in 

HUMPGDHB 


L77888_at 


HUMPTPC 


M10612_at 


a ■■ fta fti a 

HUMAPOCI1 


M11119_at 


HUMERRNA 


M11147_at 


1 II «• ^ >^ 1 

HUMFERL 


M11313_s_at 


HUMA2M 


M11353_a7 


HUMHISH3C 


M12529_at 


HUMAPOE 


M12529_at 


HUMAPOE 


M12886_at 


HUMTCBYY 


M12886_at 


HUMTCBYY 


M13207_at 


HUMCSFGMA 


M13560_s_at 


HUMIAIG6 


M13666_at 


HUMCMYBB 


M13755_at 


HUMIFN15K 


M13829_s_at 


HUMPKS 


M13829_s_at 


HUMPKS 



Kruppel related zinc finger pr av dif pos 

1 7 beta hydroxysteroid dehydro av dif pos 

1 7 beta hydroxysteroid dehydro av dif pos 

transketolase \(tk\) mRNA, :" av dif pos 

transketolase \(tk\) mRNA, :" av dif pos 

FMR1 gene, 3'end. av dif pos 

ribosomal protein L27 \(RPL27 av dif pos 

macrophage migration inhibitor av dif pos 

macrophage migration inhibitor av dif pos 

histone H2A.2 mRNA, :H2Ahist av dif pos 

GDP-dissociation inhibitor pro av dif pos 

ferritin heavy chain mRNA, :f av dif pos 

peripheral benzodiazepine rece av dif pos 

peripheral benzodiazepine rece av dif pos 

suilisol mRNA, av dif pos 

cytidine deaminase \(CDA\) mRN av dif pos 

effector cell protease recepto av dif pos 

protein kinase \(MLK-3\) mRNA, av dif pos 

ras GTPase-activating-like pro av dif pos 

polycystic kidney disease 1 pr av dif pos 

\(clone FFE-7\) type II inosin av dif pos 

\(clone FFE-7X) type II inosin av dif pos 

CD24 signal transducer mRNA, c av dif pos 

RNA polymerase II mRNA, rpoly av dif pos 

ADP-ribosylation factor mRNA, av dif pos 

5,10-methenyltetrahydrofolate av dif pos 

ribosomal protein L34 \(RPL34 av dif pos 

transcription factor SL1 mRNA, av dif pos 

thyroid receptor interactor \( av dif pos 

thyroid receptor interactor \( av dif pos 

thyroid receptor interactor \( av dif pos 

\(clone S1 64\) mRNA, 3* end of av dif pos 

H. sapiens peroxisome prolifer av dif pos 

retinoblastoma susceptibility av dif pos 

\(clone 35.3\) DRAL mRNA, :\( av dif pos 

phosphatase 2A B56-alpha \(PP2 av dif pos 

RLIP76 protein mRNA, :RLIP76 av dif pos 

FRG1 mRNA, :FSHD region gene av dif pos 

NAD+-dependent 1 5 hydroxyprost av dif pos 

protein tyrosine phosphatase m av dif pos 

apolipoprotein Oil gene, av dif pos 

endogenous retrovirus envelope av dif pos 

ferritin L chain mRNA, :ferri av dif pos 

alpha-2-macroglobulin mRNA, : av dif pos 

H3.3 histone class C mRNA, av dif pos 

apolipoprotein E mRNA, :apoli av dif pos 

apolipoprotetn E mRNA, rapoli av dif pos 

T-cell receptor active beta-ch av dif pos 

T-cell receptor active beta-ch av dif pos 

granulocyte-macrophage colony- av dif pos 

la-associated invariant gamma- av dif pos 

c-myb mRNA, 3' end. av dif pos 

interferon-induced 1 7-kDa/1 5-k av dif pos 

putative raf related protein av dif pos 

putative raf related protein av djf pos 
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M13903_at 


HUMINV2 


M13929~s_at 


HUMMYCPOA 


M13929ls~at 


HUMMYCPOA 


M 13934 cds2 at 


HUMRPS14 


M13955~at 


HUMKERMII 


M14199~s at 


HUMLAMR 


M14199 s at 


HUMLAMR 


M14328 s~at 


HUMENOA 


M14483_ma1_s_at 


HUMTHYMAA 


M14676~at 


HUMSLK 


M14678lat 


HUMSLK 


M15395_at 


HUMLAP 


M1 5661 at 


HUMRPZH21 


M1 566 Cat 


HUMRPZH21 


M16038_at 


HUMLYN 


M17733_at 


HUMTHYB4 


M17863ls_at 


HUMFFI2B 


M17863_s~at 


HUMFFI2B 


M17885~at 


HUMPPARPO 


M17886_at 


HUMPPARP1 


M1800(Tat 


HUMRPS17A 


M18737~ma1_at 


HUMHFSP 


M19045~f_at~ 


HUMLSZH 


M19159~at 


HUMALPPD 


M19159_at 


HUMALPPD 


M1930llat 


HUMKAD 


M19878ls_at 


HUMCALB01 


M20902_at 


HUMAPOCIA 


M20902_at 


HUMAPOCIA 


M21142lcds2_s_at 


HUMGNAS6 


M21142_cds2~s_at 


HUMGNAS6 


M21186~at 


HUMNCBLCA 


M21186~at 


HUMNCBLCA 


M21302~at 


HUMSPR2B 


M21984 at 


HUMTRT 


M22490 at 


HUMBMP2B 


M22960 at 


HUMPPR 


M23178 s at 


HUMG0S19A 


M23613_at 


HUMNPM 


M24194_at 


HUMMHBA123 


M24194~at 


HUMMHBA123 


M24485~s_at 


HUMGSTP1G 


M24486~s_at 


HUMPYHBASA 


M25079~s~at 


HUMBETGLA 


M25079~Y"at 


HUMBETGLA 


M25280 at 


HUMLNHR 


M26311 s at 


HUMCFA 


M26311 s at 

If lAVw 1 1 Jj Ml 


HUMCFA 


M2666sVat 


HUMHIS2X 


M26708_s_at 


HUMPTAA 


M2873<Ls_at 


HUMQBPC6 


M27281_at 


HUMVPF 


M27749_r_at 


HUMIGLR141 


M27749_r_at 


HUMIGLR141 


M27826 at 


HUMRTVLH3 



involucnn gene, exon 2. :invo av dif pos 

omyoP64 mRNA, initiating fro av dif pos 

c-myoP64 mRNA, initiating fro av dif pos 

ribosomal protein S14 gene, : av dif pos 

mesothelial keratin K7 \(type av dif pos 

laminin receptor \(2H5 epitope av dif pos 

laminin receptor \(2H5 epitope av dif pos 

alpha enolase mRNA, :enolase av dif pos 

prothymosin alpha mRNA, av dif pos 

src-like kinase \(slk\) mRNA, av dif pos 

srolike kinase \{slk\) mRNA, av dif pos 

leukocyte adhesion protein \(L av dif pos 

ribosomal protein mRNA, :ribo av dif pos 

ribosomal protein mRNA, :ribo av dif pos 

lyn mRNA encoding a tyrosine k av dif pos 

thymosin beta-4 mRNA, :thymos av dif pos 

preproinsulin-like growth fact av dif pos 

preproinsulin-like growth fact av dif pos 

acidic ribosomal phosphoprotei av dif pos 

acidic ribosomal phosphoprotei av dif pos 

ribosomal protein S17 gene, : av dif pos 

Hanukah factor serine protease av dif pos 

lysozyme mRNA, :"lysozyme ""m av dif pos 

placental heat-stable alkaline av dif pos 

placental heat-stable alkaline av dif pos 

branched-chain alpha-keto acid av dif pos 

calbindin 27 gene, exons 1 and av dif pos 

apolipoprotein C-l \(VLOL\) ge av dif pos 

apolipoprotein C-l \(VLDL\) ge av dif pos 

guanine nudeotide-binding pro av dif pos 

guanine nucieotide-binding pro av dif pos 

neutrophil cytochrome b light av dif pos 

neutrophil cytochrome b light av dif pos 

small proline rich protein \(s av dif pos 

\(clone PWHTnTI 6\) skeletal mu a v dif pos 

bone morphogenetic proteirv2B av dif pos 

protective protein mRNA. :pro av dif pos 

homotogue-1 of gene encoding a av dif pos 

nucleophosmin mRNA, mucleoph av dif pos 

MHC protein homologous to chic av dif pos 

MHC protein homologous to chic av dif pos 

\(cione pHGST-pi\) glutathione av dif pos 

prolyl 4-hydroxylase alpha sub av dif pos 

sickle cell beta-globin mRNA. av dif pos 

sickle cell beta-globin mRNA, av dif pos 

lymph node homing receptor mRN av dif pos 

cystic fibrosis antigen mRNA, av dif pos 

cystic fibrosis antigen mRNA, av dif pos 

histatin 2 \(HIS2\) mRNA. :hi av dif pos 

prothymosin alpha mRNA \(ProT- av dif pos 

mitochondrial ubiquinone-bfndi av dif pos 

vascular permeability factor m av dif pos 

immunoglobulin-related 14.1 pr av dif pos 

immunoglobulirwelated 14.1 pr av dif pos 

endogenous retroviral protease av dif pos 
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M27891_at 


HUMCYS3A3 


M28212_at 


HUMRAB6A 


M28882_s_at 


• ii i ft ft ft ft t t^s a A r*> 

HUMMUC18B 


M288B2_s_at 


■ i* mill i^s A A n 

HUMMUC18B 


M29335_a7 


i it ii it ii ir>^^ a 

HUMMHDOA 


M29335_at 


* II iftftftftl I^N^tA 

HUMMHDOA 


M29610_s_at 


l ll l ft ft^^t w^™ 

HUMGLYE 


M30818_at 


HUMMXB 


M30938_at 


| 1 1 I k 1 I/I IQ 

HUMKUP 


ft ft *S A *\ A A l 

M31303_rna1_at 


ill IMnDlPA 

HUMUr loA 


M31303_ma1_at 


HUMOPioA 


M31520_at 


HUMRPS24A 


ft ft *% A P ■ - A A * M A 

M31 520_ma1_s_at 




M31627_at 


LJi tMUYQDi 


1114 i _t 

M31994_at 


nUMALUWl o 


M32053_at 




M32304_s_at 


Ul IMMCT 


M32405_at 


HUMRIGA 


M32886_at 


humsricpa 


M33600J_at 


■ || mi Ill f"\ O A 4 f* 

HUMMHDRA1C 


M33680_at 


i ii ii a -r i n A 4 

HUMTAPA1 


M33684_s_at 


i ii ii mnnn4 a c 

HUMPPPB1A5 


M34041_at 


HUMADRA2RA 


M34182_at 


HUMPRKACG 


M34516_at 


1 II 11 BIZ***! S ^l»*l 

HUM1GL122 


M34516_r_at 


i ii i & ft ■ i a ^ ^ 

HUMIGL122 


M34516_r_at 


I II II 4 

HUMIGL122 


M34715_at 


111 UinCD ^ A A 

HUMPSBGAA 


M34996_s_at 


III umA A 4 A 

HUMDQA1A 


M34996_s_at 


i II IIIHAA 4 A 

HUMDQA1A 


M35198_at 


1 || iiiiiiiTnC A 

HUMINTBoA 


M35252_at 


ill iu^A/NTA A 

HUMCOOTAA 


M35878_at 


i_ii ii iiq ni 
HUM1BP3 


M36072_at 


lji it mni 7A 

HUMRPL7A 


M37238_s_at 


iii ii mi 

HUMPLC 


M37245_at 


HUMIGCTL3 


M37245_at 


■ II 11 IIOOTI 4 

HUMIGCTL3 


M37435_at 


HUMCSDF1 


M37583_at 


HUMHIS2AZ 


M37815_cds1_at 


HUMCD284 


M38449_s_at 


• ii tft • t a 

HUMTGFBA 


M38690_at 


i It Ift A A ft f^P^%^%#\ 

HUMANTCD9 


M38690_at 


■ ii ii ■ i i iTAnn 

HUMANTCD9 


M54995_at 


1 II II IrtT A 

HUMCTAP3 


M55409_s_at 


1 1* II m A & 1/^ A A 1 

HUMPANCAN 


M554Q9_s_at 


i ii innAit^Aki 

HUMPANCAN 


M57293_aF 


Ul ii inTUDOA 

HUMPTHRPA 


Mo/ o9v^ ai 


Ul IMHRNIP 1 


M57399^at 


HUMHBNF1 


M57466_s_at 


HUMMHDPL 


M57466_s_at 


HUMMHDPL 


M57710_at 


HUMBPIGE 


M58378_cds1_at 


HUMSYN1E13 


M58525_s_at 


HUMCOMTC 


M58525_s_at 


HUMCOMTC 



cystatin C \(CST3\) gene, exon av dif pos 

GTP-binding protein \(RAB6\) m av dif pos 

MUC18 glycoprotein mRNA. :mel av dif pos 

MUC18 glycoprotein mRNA, :mel av dif pos 

MHC class II DO-alpha mRNA, av dif pos 

MHC class II DO-alpha mRNA, av dif pos 

glycophorin E mRNA, :glycopho av dif pos 

interferon-induced cellular re . av dif pos 

Ku \(p70/p80\) subunit mRNA, av dif pos 

oncoprotein 1 8 \(Op1 8\) gene, av dif pos 

oncoprotein 1 8 \(0p1 8\) gene, av dif pos 

ribosomal protein S24 mRNA. av dif pos 

ribosomal protein S24 mRNA. :r av dif pos 

X box binding protein-1 \(XBP- av dif pos 

aldehyde dehydrogenase \(ALDH1 av dif pos 

H19RNAgene, av dif pos 

metalloproteinase inhibitor mR av dif pos 

homologue of rat insulinoma ge av dif pos 

sorcin CP-22 mRNA, isorcin :s av dif pos 

MHC class II HLA-DR-beta-1 \(H av dif pos 

26-kDa cell surface protein TA av dif pos 

\(clone lambda-16-1\) non-rece av dif pos 

alpha-2-adrenergic receptor \( av dif pos 

testis-specific protein kinase av dif pos 

omega light chain protein 14.1 av dif pos 

omega light chain protein 14.1 av dif pos 

omega light chain protein 14.1 av dif pos 

pregnancy-specific beta-1-glyc av dif pos 

MHC cell surface glycoprotein av dif pos 

MHC cell surface glycoprotein av dif pos 

integrin B-6 mRNA. :integrin, av dif pos 

CO-029. transmembrane 4 super av dif pos 

insulin-like growth factor-bin av dif pos 

ribosomal protein L7a \(surf 3 av dif pos 

phospholipase C mRNA, rphosph av dif pos 

Ig superfamily cytotoxic T-lym av dif pos 

Ig superfamily cytotoxic T-lym av dif pos 

macrophage-specific colony-sti av dif pos 

histone \(H2A.Z\) mRNA, :hlst av dif pos 

T-cell membrane glycoprotein C av dif pos 

transforming growth factor-bet av dif pos 

CD9 antigen mRNA, :CD9antige av dif pos 
CD9 antigen mRNA, :CD9antige av dif pos 
connective tissue activation p av dif pos 

pancreatic tumor-related prote av dif pos 
pancreatic tumor-related prote av dif pos 

parathyroid hormone-related pe av dif pos 
nerve growth factor \(HBNF-1 \) av dif pos 
nerve growth factor \(HBNF-1\) av dif pos 
MHC class II HLA-DP light chai av dif pos 
MHC class II HLA-DP light chai av dif pos 
IgE-binding protein \(epsilon- av dif pos 

synapsin I \(SYN1>) gene, exon av dif pos 
catechol-O-methyttransferase av dif pos 

catechol-O-methyltransf erase av dif pos 
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M59216 s at UMGABRB1S5 gamma-aminobutyric acid-A \(GA av dif pos 

M5937raT HUMECK protein tyrosine kinase mRNA, av dif pos 

M59807lat HUMNK4 NK4 mRNA, : natural killer eel av dif pos 

M59830 at HUMMHHSP2 MHC class III HSP70-2 gene \(H av dif pos 

M5991 Tat HUMINTA3A integrin atpha-3 chain mRNA, av dif pos 

M80483~ma1 s_at HUMPP2AA protein phosphatase 2A catalyt av dif pos 

M60854~at " HUMSRAA ribosomal protein S18 mRNA, : av dif pos 

M61916~at HUMLAM101 laminin B1 chain mRNA, ilamin av dif pos 

M62403~s at HUMIGFBP5 insulin-like growth factor bin av dif pos 

M62403"Vat HUMIGFBP5 insulin-like growth factor bin av dif pos 

M62486~at UMPRPC4S1 2 C4b-binding protein gene, exon av dif pos 

M63256~at HUMCDR2AA major Yo paraneoplastic antige av dif pos 

M63379~"at HUMTRPM2A4 TRPM-2 protein gene, exons 7,8 av dif pos 

M63438~s at HUMIGGK lg rearranged gamma chain mRNA av dif pos 

M63438~s~at HUMIGGK lg rearranged gamma chain mRNA av dif pos 

M63573~a7 HUMSCYLP secreted cyclophilin-like prot av dif pos 

M83589lat HUMSCL7 stem cell leukemia gene produc log neg 

M64347~at HUMFGFLR novel growth factor receptor m log neg 

M64347~at HUMFGFLR novel growth factor receptor m log neg 

M64673lat HUMHSF1 heat shock factor 1 \(TCF5\) m log neg 

M64716~at HUMRPS25 ribosomal protein S25 mRNA, : log neg 

M64992~at HUMPROS30 prosomal protein P30-33K \(pro log neg 

M65292~s_at HUMHAAA factor H homologue mRNA, :"fa log neg 

M65292~s at HUMHAAA factor H homologue mRNA, : M fa log neg 

M69023laT HUMGGEFERA globingene. log neg 

M69066_at HUMMOESIN moesln mRNA, :moesin imoesin log neg 

M69238jtt HUMARNTA aryt hydrocartwn receptor nucl log neg 

M73077~at HUMGRF1 A glucocorticoid receptor repres log neg 

M73239ls_at HUMSCFA1 \(cJone SF1\) hepatocyte growt log neg 

M73547_at HUMPOLLA polyposis locus \(DP1 gene\) m log neg 

M74093lat HUMCLNC cyclin mRNA. :cyclin E1 log neg 

M74297_at HUMHOX14 homeobox 1 .4 protein mRNA, :h log neg 

M74715_s at HUMIDNAL alpha-L-iduronidas \(IDUA\) mR log neg 

M77232lma1_at HUMRPS8B ribosomal protein S6 gene, com log neg 

M77836 at " HUMP5CR pyrroline 5-carboxylate reduct log neg 

M80244jJt HUME16GEN E16 mRNA, log neg 

M80254 at HUMCYP cyciophilin isoform \(hCyP3\) log neg 

M80359lat HUMP78A protein p78 mRNA, :MAP/microt log neg 

M80563 at HUMCAPL CAPL protein mRNA, :S100 calc log neg 

M80563~at HUMCAPL CAPL protein mRNA, :S100calc log neg 

M80899 at HUMAHNAKA novel protein AHNAK mRNA, part log neg 

M81750~at HUMMCNDA myeloid cell nuciear different log neg 

M81757~at HUMS19RP S1 9 ribosomal protein mRNA, log neg 

M81883jrt HUMGAD67A glutamate decarboxylase \(GAD6 log neg 

M83181 at HUMHTRB serotonin receptor gene, :5-h log neg 

M84424lat HUMCTSE09 cathepsin E \(CTSE\) gene, exo log neg 

M8471 1~at HUMFTE1 A v-fos transformation effector log neg 

M85289~at HUMHSPG2B heparan sulfate proteoglycan log neg 

M8640(fat HUMPHPLA2 phospholipase A2 mRNA, :tyros log neg 

M86699~at HUMTTK kinase \(TTK\) mRNA. :TTK pro log neg 

M86737~at HUMHMGBP high mobility group box \(SSRP log neg 

M87789~s at HUMIGHEPAH \(hybridoma H210\) anti-hepati log neg 

M87789Vat HUMIGHEPAH \(hybridoma H210\) anti-hepati log neg 

M90356j3t HUMBTFD BTF3 protein homologue gene, log neg 

M90656~at HUMGCSH gamma-glutamylcysteine synthet log neg 

o2^ 
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M91670_at HUME2EPI ubiquitin carrier protein \(E2 log neg 

M94856lat HUMFABPHA fatty acid binding protein horn log neg 

M 94856 at HUMFABPHA fatty acid binding protein horn log neg 

M94880 J_at HUMHLAAX MHC class I \(HLA-A*8001 \) mRN log neg 

M96233~s~at HUMGSTM4A glutathione transferase class log neg 

M96233_s_at HUMGSTM4A glutathione transferase class log neg 

M96326_ma1_at HUMAZCDI azurocidin gene, log neg 

M96956_at HUMTDGF3A \(done CR-3\) teratocardnoma log neg 

M97796_s_at HUMID2X helix-loop-helix protein \(ld- log neg 

M97815_at HUMCRABP02 retinoic acid-binding protein log neg 

S34389_at HMOX2 heme oxygenase-2 [human, kidne log neg 

S58544_at SPAG1 75 kda infertility-related spe log neg 

S691 1 Sj*t S691 1 5 granulocyte colony-stimulating log neg 

S691 1 5_at S691 1 5 granulocyte colony-stimulating log neg 

S71043_ma1_s_at S71043 lg alpha ^immunoglobulin A he log neg 

S71 043_ma1_s_at S71 043 lg alpha ^immunoglobulin A he log neg 

S73591_at " VDUP1 brain-expressed HHCPA78 homolo log neg 

S7359llat VDUP1 brain-expressed HHCPA78 homolo log neg 

S75463_at S75463 P43=mitochondriai elongation f log neg 

S77356_at S77356 transcript ch21=oligomycin sen log neg 

S77582lat S77582 HERVK10/HUMMTV reverse transcr log neg 

S76234_at S78234 nuc2 homolog [human, fibroblas log neg 

S7877tls_at S78771 NAT^CpG island-associated gene log neg 

S79219_s_at S79219 metastasis-associated gene [hu log neg 

S79522_at S79522 ubiquitin carboxyl extension p log neg 

S80562_at CNN3 acidic calponin [human, kidney log neg 

S82297_at S82297 beta 2-microglobulin {1 1 bp del log neg 

S82597_ma1_s_at S82597 UDP-GalNAc:polypeptide log neg 

S90469_at POR cytochrome P450 reductase [hum log neg 

U00947_s_at U00947 clone C4E 3.2 \(CAC\)nA(GTG\) log neg 

U03397_s_at U03397 receptor protein 4-1 BB mRNA, log neg 

U03398_a7 TNFSF9 receptor 4-1 BB ligand mRNA, : log neg 

U04241~at U04241 homolog of Drosophila enhancer log neg 

U04313_at PI5 maspin mRNA, tprotease inhibi log neg 

U05340_at CDC20 p55CDC mRNA, : ceil division c log neg 

U06155_s_at U06155 chromosome 1q subtelomeric seq log neg 

U06863_at U06863 fotlistatin-related protein pr log neg 

U06863lat U06863 foilistatin-related protein pr log neg 

U091 1 7~at PLCD1 phospholipase c delta 1 mRNA. log neg 

U09303jrt EFNB1 T cell leukemia LERK-2 \(EPLG2 log neg 

U0981 3~at ATP5G3 mitochondrial ATP synthase sub log neg 

U09953_at U09953 ribosomal protein L9 mRNA, :r log neg 

U10362~at U10362 GP36b glycoprotein mRNA, log neg 

U10492lat HSMOX1 Mox1 protein \(MOX1\) mRNA, : log neg 

U12404~at U12404 Csa-19mRNA, log neg 

U12404~at U12404 Csa-19mRNA, log neg 

U1 2465lat U1 2465 ribosomal protein L35 mRNA, log neg 

U12779~at U12779 MAP kinase activated protein k log neg 

U1439llat MY01C myosin- IC mRNA, imyosin IC log neg 

U14588_at PXN paxillin mRNA, :paxillin :pax log neg 

U14968~at U14968 ribosomal protein L27a mRNA. log neg 

U14969~at U14969 ribosomal protein L28 mRNA, : log neg 

U1497(fat U14970 ribosomal protein S5 mRNA. :r log neg 

U14971_at U14971 ribosomal protein S9 mRNA, :r log neg 

U1497llat U14971 ribosomal protein S9 mRNA, :r log neg 



WO 00/52204 



PCT/IB00/00367 



U14972_at 




U14972 


U14973_at 




U14973 


U15177_at 




U15177 


U16660_at 




ECH1 


U16799_s_at 




U16799 


U16861 at 




KCNJ2 


U17077_at 




BENE 


U17760_ma1_ 


at 


HSLAMB3S17 


U19247_ma1_ 


s^at 


HSINFGRA7 


U19251_s_at 




NAIP 


U20657_aT 




USP4 


U20734_s_at 




U20734 


U20734_s_at 




U20734 


U20758_ma1_at 


U20758 


U22376_cds2_s_at 


MYB 


U22431_s_at 




U22431 


U22970_ma1_ 


s_at 


U22970 


022970.0131. 


s_at 


U22970 


U24183_s_at 




U24183 


U24389_s_at 




HSLY0XL7 


U25789_a7 




U25789 


U27333_at 




U27333 


U27333_at 




U27333 


U27831_at 




U27831 


U29175_at 




U29175 


U29953_ma1_ 


.at 


PEDF 


U30827_s at 




U30827 


U30888_at 




USP14 


U30888_at 




USP14 


U31814_at 




HDAC2 


U31875_at 




HEP27 


U32944_at 




PIN 


U34880_at 




U34880 


U36341_ma1_ 


_at 


U38341 


U36764_at 




U38764 


U37012_at 




U37012 


U37146 at 




U37146 


U37408_at 




CTBP1 


U37689_at 




POLR2H 


U38278_at 




SEMA3F 


U38276_at 




SEMA3F 


U39400_at 




C11orf4 


U40998_at 




U40998 


U41060_at 




U41060 


U41766_s_at 




ADAM9 


U42359_at 




HUMN33S10 


U43328_at 




U43328 


U43901_ma1 


_s.at 


U43901 


U43901_ma1 


_s_at 


U43901 


U45448_S at 




U45448 


U48705_ma1 


_s_at 


U48705 


U48936_at 




U48936 


U48938_at 




U48936 


U49395_at 




U49395 


U49869_ma1 


_at 


UBB 



ribosomal protein S10 mRNA, log neg 

ribosomal protein S29 mRNA, : log neg 

cosmid CRI-JC2015 at D10S289 i log neg 

peroxisomal enoyl-CoA hydratas log neg 

Na.K-ATPase beta-1 subunrt mRN log neg 

inward rectifying potassium ch log neg 

BENE mRNA, :BENE protein log neg 

laminin S B3 chain \(LAMB3\) g log neg 

interferon-gamma receptor alph log neg 

neuronal apoptosis inhibitory log neg 

ubiquitin protease \(Unph\) pr log neg 

transcription factor junB \Qu log neg 

transcription factor junB \Qu log neg 

osteopontin gene, log neg 

\(omyb\) gene, complete prima log neg 

hypoxia-inducible factor 1 alp log neg 

interferon-inducible peptide log neg 

interferon-inducible peptide log neg 

phosphofructokinase \(PFKM\) m log neg 

lysyl oxkJase-iike protein gen log neg 

ribosomal protein L21 mRNA, : log neg 

alpha \(1 ,3\) fucosyltransfera log neg 

alpha \(1 ,3\) fucosyltransfera log neg 

striatunvenriched phosphatase log neg 

transcriptional activator \(BR log neg 

pigment epithelium-derived fac log neg 

splicing factor SRp40-3 \(SRp4 log neg 

tRNA-guanine transglycosylase log neg 

tRNA-guanine transglycosylase log neg 

transcriptional regulator homo log neg 

Hep27 protein mRNA, :short-ch log neg 

cytoplasmic dynein light chain log neg 
DPH2L mRNA', :DPH2L "mRNA," co log neg 

Xq28 cosmid, creatine transpor log neg 

TGF-beta receptor interacting log neg 

cleavage and polyadenylation s log neg 

silencing mediator of retinoid log neg 

phosphoprotein CtBP mRNA, :C- log neg 

RNA polymerase II subunrt \(hs log neg 

semaphorin III family homolog log neg 

semaphorin III family homotog log neg 
NOF1 mRNA, .chromosome 11 op log neg 

retinal protein \(HRG4\) mRNA, log neg 

breast cancer, estrogen regula log neg 

metailoprotease/disintegrin/cy tog neg 

N33 protein form 1 \(N33\) gen log neg 

link protein mRNA, log neg 

37 kD iaminin receptor precurs log neg 

37 kD laminin receptor precurs log neg 

P2x1 receptor mRNA, log neg 

receptor tyrosine kinase DOR g log neg 

amiloride-sensitive epithelial log neg 

amiloride-sensitive epithelial log neg 

ionotropic ATP receptor P2X5a log neg 

ubiquitin gene, :ubiquitin B log neg 
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U50523_at 




U50523 


U50929_at 




BHMT 


U52154_at 




KCNJ5 


U52154_at 




KCNJ5 


U52696_s_at 




U52696 


U53786_at 




U53786 


U55054_at 




HSKCC 


U55054_at 




HSKCC 


U57341_r_at 




U57341 


U57342_aT 




MLF2 


U57629_at 




RPGR 


U58682_at 




U58682 


U60975_at 




U60975 


U60975_at 




U60975 


U62739_at 




BCAT2 


U62962_at 




EIF3S6 


U63541_at 




U63541 


U64863_at 




PDCD1 


U66G61_cds3_at 


U66061 


U66406_at 




EFNB3 


UB6616_at 




SMARCC2 


U66616_at 




SMARCC2 


U67092_s_at 




U67092 


U67156_at 




MEKK5 


U68105_s_at 




HSPABPS1 3 


U70732_ma1. 


.at 


GPT 


U70867_at 




SLC21A2 


U73379_at 




U73379 


U73379_at 




U73379 


U73824 at 




EIF4G2 


U73843_at 




U73843 


U77456_at 




NAP1L4 


U77846_ma1, 


_at 


U77846 


U77846_ma1. 


_s_at 


U77846 


U77846_ma1. 


_s_at 


U77846 


U78027_ma3_at 


U78027 


U78095_at 




U78095 


U78678_at 




U78678 


U78722_at 




U78722 


U78735 at 




U78735 


U79256_at 




U79258 


U79280_at 




U79280 


U79299_at 




U79299 


U80184_ma1 




FLU 


U81984_at 




EPAS1 


U82169_at 




F2D9 


U82169_at 




FZD9 


U82818 at 




U82818 


U83248_at 




CPNE1 


U83598_at 




U83598 


U86136_at 




U86136 


U87972_at 




U87972 


U88964 at 




ISG20 


U89328_at 




U89326 


U90426_at 




DDXL 



BRCA2 region, mRNA sequence CG log neg 



betainerhomocysteine methyltra log neg 

G protein-coupled inwardly rec log neg 

G protein-coupled inwardly rec log neg 

adrenal Cretnp homolog \(Creb log neg 

envoplakin \(EVPL\) mRNA, :en log neg 

K-CI cotransporter V(hKCC1\) m log neg 

K-CI cotransporter \(hKCC1\) m log neg 

neurofilament triplet L protei log neg 

myelodysplasia/myeloid leukemi log neg 

retinitis pigmentosa GTPase re log neg 

ribosomal protein S28 mRNA, : log neg 

hybrid receptor gp250 precurso log neg 

hybrid receptor gp250 precurso log neg 

branched-chain amino acid amin log neg 

lnt-6 mRNA, reukaryotic trans log neg 

mRNA expressed in HC/HCC liver log neg 

hPD-1 \(hPD-1\) mRNA, :progra log neg 

germiine T-cell receptor beta log neg 

putative EPH-re!ated PTK recep log neg 

SWI/SNF complex 1 70 KOa subuni log neg 

SWI/SNF complex 170 KDa subuni log neg 

ataxia-telangiectasia locus pr log neg 

mitogen-activated kinase kinas log neg 

poly\(A\)-binding protein \(PA log neg 

glutamate pyruvate transaminas log neg 

prostaglandin transporter hPGT log neg 

cyclin-selective ubiquitin car log neg 

cyclin-selective ubiquitin car log neg 

p97 mRNA, :eukaryotic transla log neg 

epithelial-specific transcript log neg 

nucleosome assembly protein 2 log neg 

elastin gene, partial cds and log neg 

elastin gene, partial cds and log neg 

elastin gene, partial cds and log neg 

Bruton's tyrosine kinase \(BTK log neg 

placental bikunin mRNA, :Plac log neg 

thioredoxin mRNA, nuclear gene log neg 

zinc finger protein 165 \(Zpf1 log neg 

ABC3 mRNA, log neg 

clone 23719 mRNA sequence. log neg 

clone 23575 mRNA, log neg 

neuronal olfactomedin-retated log neg 

FLU gene, iflightless I \(Dr log neg 

endothelial PAS domain protein log neg 

frizzled homolog \(FZD3\) mRNA log neg 

frizzled homolog \(FZD3\) mRNA log neg 

UCP3S mRNA, log neg 

copine I mRNA, icopine I :cop log neg 

death domain receptor 3 solubl log neg 

telomerase-associated protein log neg 

NAD+-isocitrate dehydrogenase log pos 

HEM45 mRNA, tinterferon stimu log pos 

bone morphogenetic protein rec log pos 

nuclear RNA helicase, :nuclea log pos 
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U90552_s_at 


U90552 


outyropniiin \(o i roy mKwA, . 


Inn no*; 


U90913_at 


U90913 


clone 23665 mRNA sequence, .cj 


iuy poo 


U90916_at 


U90916 


clone 23o 1 5 mKNA sequence, .ci 


iuy puo 


U94747_at 


HAN11 


wu repeat protein man 1 i itiknm, 


Inn nnc 


U95740_ma1_at 


U95740 


onromosome 10 omu cione ui i wo/ 


Inn nnc 


V00571_ma1_at 


HSPCRF 


gene encoding prepro form of c 


Inn nnc 

log pus 


V00572lat 


HSPGK1 


mRNA encoding phosph ugly cerate 


log pos 


V00594_s_at 


HSTHIO 


metallothionein from cadmium- 


log pos 


V01512_rna1_at 


HSCFOS 


cellular oncogene ofos \(comp 


1 A/t MAC 

log pos 


X00274_at 


HSHL07 


gene for HLA-DR alpha heavy ch 


Inn nnc 

lug pus 


X00274_at 


HSHL07 


gene for HLA-DR alpha heavy ch 


Iam nnc 

log pos 


X00351_f_at 


HSAC07 


beta-actin. 


Iaai mac 

lug pos 


X00368_xpt2_at 


HSPROL1 


prolactin gene 5* region. 


log pos 


X01677J_at 


HSGAPDR 


liver glyceraldehyde-3-phosph 


log pos 


X02152_at 


HSLDHAR 


lactate denydrogenase-A hlum 


Inn nnc 

IUy puc> 


X02596_at 


HSBCRR 


bcr \(breakpoint cluster regl 


Inn nnc 

iuy puo 


X03068_f_at 


HSHLDQWB 


mla-u class ii antigen uuwi.i 


Inn nnc 

iuy pud 


X031 00_cds2_at 


HSHLASBA 


nLA-oo\(Urv aipna gene, .muv- 


Inn nnc 


X03100_cds2_at 


HSHLASBA 


ii 1 a CD\/nDV\ a | n ha nana -HI A. 

nLA-ob\(Up'v aipna gene, .mum- 


Inn nnc 
IUy pwo 


X03342_at 


HSRPL32 


noosomai protein loz\ .noos 


Inn nnc 
iuy uuo 


X03689_s_at 


HSEFTUR5 


mRNA fragment tor elongation t 


Inn nnc 

iuy puo 


X03689_s_at 


HSEFTUR5 


mRNA fragment for elongation t 


Inn nnc 
IUy pUo 


XD4347_s_at 


HSUPIR1 


liver mRNA fragment dna Dinotn 


Inn nnc 

log pus 


X04347_s_at 


HSUPIR1 


liver mRNA fragment DNA bindin 


log pos 


X06814_a7 


HSRRA 


receptor of retinoic acid. :r 


log pos 


X06617_at 


HSRPS11 


ribosomal protein S1 1 . : ribo 


log pos 


X06985_at 


HSOXYGR 


heme oxygenase. :heme oxygena 


Inn nnc 

lug DOS 


X06985_at 


HSOXYGR 


heme oxygenase. :heme oxygena 


log pos 


X07896_at 


HSKERC15 


^ . A. * AC ilin mltn 4 C ■ 

cytokeratin 15. :keratin 15 . 


Inn nnc 

log pus 


X07730_at 


HSPSA 


prostate specific antigen. : 


log pos 


X07730_at 


HSPSA 


prostate specific antigen. : 


log pos 


X12447_at 


HSALDOA 


aldolase A gene \(tc 4.1.2.13 


log pos 


X12671_ma1_at 


HSHNRNPA 


gene for heterogeneous nuclear 


rvn£> 

log pos 


X12671_ma1_at 


HSHNRNPA 


gene for heterogeneous nuclear 


log pos 


X12876_s_at 


HSKER18A 


mRNA fragment for cytokeratin 


log pos 


X1 2878_A.at 


HSKER18A 


mRNA fragment for cytokeratin 


log pos 


X13334_at 


HSCD14R 


CD14 myelid cell-specific leu 


log pos 


X13546_ma1_at 


HSHMG17G 


HMG-17 gene for non-histone ch 


log pos 


X13794_ma1jrt 


HSLDHB1 


lactate dehydrogenase B gene e 


Iam »*vy^ft^ 

log pos 


X13794_ma1_at 


HSLDHB1 


lactate dehydrogenase B gene e 


log pos 


X14008_ma1J_at 


HSLYSOZY 


lysozyme gene \(EC 3.2.1. 17\). 


Inn nnc 

log pus 


X15940_at 


HSRPL31 


nbosomal protein L31. .nbos 


lug pos 


X16064_at 


HSTUMP 


• - — ft — A * ^.IL . — A It Am Aftt All 

translationaliy controlled tu 


Inn nnc 

lug pos 


X16832_at 


HSCATHH 


cathepsin H \(EC 3.4.22.1 e\). 


Inn nnc 
IOy UUS 


X17042_at 


HSHPCP 


hematopoetic proteoglycan cor 


Inn nnc 
IUy UUS 


X17206_at 


HSLLREP3 


LLRep3. : LLRep3 


Inn nnc 
IUy UVS 


X51345__at 


HSJUNB 


jun-B JUN-B protein, jun o p 


Inn nnc 

log pus 


X5i4oo — at 


notrZ 


Alnnnatinn fartfw 2 • filonoa 


lOQ DOS 


X51688_at 


HSCYCUNA 


cyclinA. 


log pos 


X52003j>t 


HSPS2MKN 


pS2 protein gene. :trefoil fac 


log pos 


X52003_at 


HSPS2MKN 


pS2 protein gene. :trefoil fac 


log pos 


X52426_s_at 


HSCYTK 


cytokeratin 13. : cytokeratin 


log pos 


X52426_s_at 


HSCYTK 


cytokeratin 13. : cytokeratin 


log pos 


X52851_ma1_at 


HSCPH70 


cyclophilin gene for cydophil 


log pos 


X52966_at 


HSL35A 


ribosomal protein L35a. :ribo 


log pos 



axt- 
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X53586_ma1_at HSINTA6R integrin alpha 6. :integrin, log pos 

X53587~at HSINTB4R Integrin beta 4. : integrin b log pos 

X537rTat HSL23MR L23 putative ribosomal protel log pos 

X54232_at HSGLYPIC heparan sulfate proteaglycan log pos 

X54687_at HSCYSTATS cystatin S. log pos 

X54942~at HSCKSHS2 ckshs2 Cks1 protein homologue log pos 

X54©42lat HSCKSHS2 ckshs2 Cks1 protein homologue log pos 

X55005_ma1_at HSCERBAR oerbA-1 thyroid hormone rece log pos 

X55715lat HSHUMS3 Hums3 40S ribosomal protein s log pos 

X55954_at HSL17ARP HL23 ribosomal protein homolo log pos 

X56494_at HSPKM12 M gene for M1 -type and M2-type log pos 

X56687_s_at HSAUTNOR autoantigen NOR-90. log pos 

X56807laT HSDGII DSC2 desmocollins type 2a and log pos 

X5684llat HSHLAE HLA-E gene. :major histocompat log pos 

X56932_at HS23KDHBP 23 kD highly basic protein. log pos 

X57351_at HS1 8D 1-8D gene from interferon-indu log pos 

X5735llat HS1 8D 1-8D gene from interferon-indu log pos 

X57351~s_at HS1 8D 1-8D gene from interferon-indu log pos 

X57809_slat HSIGVL009 rearranged Immunoglobulin lamb log pos 

X578oVY]at HSIGVL009 rearranged immunoglobulin lamb log pos 

X57959~a7 HSRBPRL7A ribosomal protein L7. iriboso log pos 

X58072_at HSGATA3R hGATA3 trans-acting T-cell sp log pos 

X59373_at HSHOX4D HOX4D a homeobox protein. :ho log pos 

X59798lat HSPRAD1CY PRAD1 cyclin. :PRAD1 cyclin log pos 

X60489lat HSEF1B elongation factor-1 -beta. log pos 

X61587_at HSRHOG rftoG GTPase. :ras homolog gen log pos 

X6232o"at HSEPITt epithelin 1 and 2. : epitheli log pos 

X62466~at HSCAMPAT1 CAMPATH-1 \(CDw52\) antigen. log pos 

X62466lat HSCAMPAT1 CAMPATH-1 \(CDw52\) antigen. log pos 

X62654lma1_at HSMECDAG gene for Me491/CD63 antigen. : log pos 

X6269llat HSRPRNA ribosomal protein \(homologuo log pos 

X63359_at HSUGT2BIO UGT2BIO udp glucuronosyltrans log pos 

X63527_at HSRPL19 ribosomal protein L19. :ribos log pos 

X63629_at HSPCAD p cadherin. :cadherin 3, P-ca log pos 

X84229 at HSDEK9 dek mRNA. :DEK gene log pos 

X64707jit HSBBC1 BBC1 mRNA. log pos 

X856lOt HSS100PCB calcium-binding protein S1 OOP log pos 

X661 1 4~ma1_at HS20XOC gene for 2-oxoglutarate came log pos 

X66363~at HSSTHPKD mRNA PCTAIRE-1 for serine/thre log pos 

X66363lat HSSTHPKD mRNA PCTAIRE-1 for serine/thre log pos 

X66899_at HSEWS EWS mRNA. :Ewing sarcoma break log pos 

X67247~ma1_at HSRPS8 rpS8 gene for ribosomal protei log pos 

X67325lat HSP27 p27 mRNA. :interferon t alpha-i log pos 

X67951~at HSPAG proliferation-associated gene log pos 

X68314~at HSGPGI glutathione peroxidase-GI. :g log pos 

X68314~at HSGPGI glutathione peroxidase-GI. :g log pos 

X68688lma1_s_at HSZNB ZNF33B gene. log pos 

X69150lat HSRPS18 ribosomal protein S18. :ribos log pos 

X6939llat HSRPL6AA ribosomal protein L6. iriboso log pos 

X69550_at HSRH01 rho GDP-dissociation Inhibito log pos 

X69654lat HSS26 ribosomal protein S26. log pos 

X7094uCs_at HSEFAC1A2 elongation factor 1 alpha-2. log pos 

X70940ls_at HSEFAC1A2 elongation factor 1 alpha-2. log pos 

X73079_a7 HSPIR encoding Polymeric immunoglobu log pos 

X73358~s_at HSAES1 hAES-1 mRNA. ramino-terminal e log pos 
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CLAIMS 

1 . A method of determining an expression pattern of a cell sample 
independent of the proportion of submucosal, smooth muscle, 
or connective tissue cells present, comprising: 
5 determining expression of one or more genes in a sample comprising 

cells, wherein the one or more genes excludes genes which are expressed in the 
submucosal, smooth muscle, or connective tissue, whereby a pattern of 
expression is formed for the sample which is independent of the proportion of 
submucosal, smooth muscle, or connective tissue cells in the sample. 
10 2. The method of claim 1 wherein the sample comprises epithelial 

or carcinoma cells. 

3. The method of claim 2 wherein the sample comprises urothelial 
or bladder cancer cells. 

4. A method of determining an expression pattern of a cell sample, 
15 comprising: 

. determining expression of one or more genes in a sample comprising 
cells, whereby a first pattern of expression is formed for the sample; 

removing expression of genes which are expressed in submucosal, 
smooth muscle, or connective tissue cells from the first pattern of expression, 
20 whereby a second pattern of expression is formed, wherein the second pattern 
is independent of the proportion of submucosal, smooth muscle, or connective 
tissue cells in the sample. 

5. The method of claim 4 wherein the cell sample is an epithelium 
or carcinoma sample. 

25 6. The method of claim 5 wherein the cell sample is a urothelium 

or bladder cancer sample. 
7. A method for determining an expression pattern of a urothelium 

or bladder cancer cell, comprising: 
determining expression of one or more genes in a sample comprising 
30 urothelium or bladder cancer cells, whereby a first pattern of expression is 
formed; 
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subtracting from the first pattern of expression a second pattern of 
expression, wherein the second pattern was formed using the one or more 
genes and a sample comprising predominantly submucosal, smooth muscle, or 
connective tissue cells, said step of subtracting forming a third pattern of 
5 expression which reflects expression of the urothelium or bladder cancer cells 
independent of the proportion of submucosal, smooth muscle, or connective 
tissue cells present in the sample. 

8. A method of detecting an invasive tumor in a patient, 
comprising: 

10 detecting in a sample of a body fluid a marker which is more prevalent 

in submucosal, smooth muscle, or connective tissue than in the body fluid, 
wherein the marker is an mRNA or protein expression product of a gene, 
wherein an increased amount of the marker in the body fluid indicates a tumor 
which has become invasive in the patient. 
15 9. The method of claim 8 wherein the body fluid is selected from 

the group consisting of blood, plasma, serum, urine, ascites 
fluid, pleural fluid, spinal fluid, sputum, and mucous secretions. 

10. The method of claim 8 wherein the marker is a protein 
characteristic of submucosal, smooth muscle, or connective 

20 tissue, but not found in the body fluid. 

11. A method to diagnose a bladder cancer comprising: 
determining a first pattern of expression of one or more genes in a 

bladder tissue sample suspected of being neoplastic; 

comparing the first pattern of expression to a second and third 
25 reference pattern of expression, wherein the second pattern is of the one or 
more genes in normal urothelium and the third pattern is of the one or more 
genes in bladder cancer, wherein a first pattern of expression which is more 
similar to the third pattern than the second indicates neoplasia of the bladder 
tissue sample. 
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12. A method to predict outcome or prescribe treatment of a 
bladder tumor, comprising: 

determining a first pattern of expression of one or more genes in a 
bladder tumor sample; i 
5 comparing the first pattern to one or more reference patterns of 

expression determined for bladder tumors at grades I to IV; 

determining which of the reference patterns shares maximum similarity 
with the first pattern, wherein the outcome or treatment appropriate for the 
grade of tumor of the reference pattern with the maximum similarity is assigned 
10 to the bladder tumor sample. 

13. A method to determine grade of a bladder tumor, comprising: 
determining a first pattern of expression of one or more genes in a 

bladder tumor sample; 

comparing the first pattern to one or more reference patterns of 
15 expression determined for bladder tumors at grades I to IV; 

determining which of the reference patterns shares maximum similarity 
with the first pattern, wherein the grade of the reference pattern with the 
maximum similarity is assigned to the bladder tumor sample. 

14. A method to determine stage of a bladder tumor, comprising: 
20 determining a first pattern of expression of one or more genes in a 

bladder tumor sample; 

comparing the first pattern to one or more stage-specific reference 
patterns; 

determining which of the stage-specific reference patterns shares 
25 maximum similarity with the first pattern, wherein the stage of the reference 
pattern with the maximum similarity is assigned to the bladder tumor sample. 

15. The method of claim 14 wherein the pattern of expression of 
the bladder tumor sample and the reference patterns comprise 
data on the expression of one or more genes selected from the 

30 group consisting of TCC-related genes, bladder papilloma- 

related genes, and invasive TCC-related genes. 

31 



16. A method of identifying a tissue sample as urothelial, 
comprising: 

determining a first pattern of expression of one or more genes in a 
tissue sample; 

comparing the first pattern of expression to a second pattern of 
expression obtained from normal urothelial cells; wherein similarity between 
the first and second patterns identifies the tissue sample as urothelial in its 
origin. 

17. The method of claim 16 further comprising the step of: 
comparing the first pattern to one or more third patterns of expression 
obtained from other cell types, wherein differences between the first 
and third patterns confirms the suggestion that the tissue sample is 
urothelial in origin. 

18. A method to identify a set of genes useful for diagnosing, 
predicting outcome, or prescribing treatment of a bladder 
cancer comprising: 

determining a first pattern of expression of one or more genes in a first 
bladder tissue sample; 

determining a second pattern of expression of the one or more genes 
in a second bladder tissue sample, wherein the first bladder tissue sample is a 
normal urothelium sample or an earlier stage or lower grade of bladder tumor 
than the, second bladder tissue sample; 

comparing the first pattern of expression to the second pattern of 
expression to identify a first set of genes whose expression is increased or 
decreased in the second bladder tissue sample relative to the first bladder tissue 
sample; 

removing from the first set of genes those genes which are expressed 
in submucosal, smooth muscle or connective tissue to produce a second set of 
genes, wherein measurement of expression of the second set of genes can be 
used for diagnosing, predicting outcome, or prescribing treatment of a bladder 
cancer. 
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19. A method of determining an expression pattern of a bladder 
tissue sample independent of the proportion of submucosal, 
smooth muscle, or connective tissue cells present, comprising: 

isolating a single-cell suspension of disaggregated bladder tumor cells 
5 from a bladder tissue sample comprising bladder ceils, and cells of one or more 
of a cell type selected from the group consisting of submucosal cells, smooth 
muscle cells, or connective tissue cells; 

determining expression of one or more genes in the single-cell 
suspension, whereby a pattern of expression is formed for the sample which is 
10 independent of the proportion of submucosal, smooth muscle, or connective 
tissue cells in the bladder tissue sample. 

20. The method of any of claims 1-19 wherein expression of a gene 
is determined by assaying for an mRNA transcribed from the 
gene or a protein translated from an mRNA transcribed from 

15 the gene. 

21. The method of any of claims 1-19 wherein expression of a 
plurality of genes is determined: 

22. A method of screening for candidate therapeutic agents for 
treating bladder cancer, comprising the steps of: 

20 contacting bladder tumor cells with a test compound; 

determining gene expression of one or more genes in the bladder tumor 
cells which have been contacted with the test compound, wherein expression 
of the one or more genes changes during the development of a bladder cancer; 
identifying a test compound as a candidate therapeutic agent if it causes 
25 gene expression of at least one of the one or more genes to change to a level 

which is characteristic of an earlier stage of cancer progression. 

23. A method of categorizing a tumor, comprising the steps of: 
mixing cells of a plurality of tumors, wherein the tumors are of a single 

type and of a similar stage or grade to form a pool; 
30 determining expression of one or more genes in the pool; 
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comparing expression of the one or more genes in the pool to 
expression in a test sample derived from a tumor, wherein similarities between 
the test sample expression and the pool expression permit categorization of the 
tumor. 

5 24. A method of categorizing a tumor, comprising the steps of: 

mixing one or more gene products from cells of a plurality of 
tumors, wherein the tumors are of a single type and of a similar stage or grade 
to form a pool, wherein the gene product is mRNA or protein; 

determining expression of one or more genes in the pool by 
10 assaying the gene product in the pool; 

comparing expression of the one or more genes in the pool to 
expression in a test sample derived from a tumor, wherein similarities between 
the test sample expression and the pool expression permit categorization of the 
tumor. 

15 25. A method of categorizing a tumor, comprising the steps of: 

determining expression of one or more genes in a plurality of 
tumors, wherein the tumors are of a single type and of a similar stage or grade; 

combining data determined for the expression of the one or 
more genes to form a data pool; 
20 comparing expression of the one or more genes in the data pool 

to expression in a test sample derived from a tumor, wherein similarities 
between the test sample expression and the data pool permit categorization of 
the tumor. 

26, The method of claim 23, 24, or 25 wherein the tumors are 
25 bladder tumors. 
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Fig. 2 was missing at the time of publication 
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FIG.7A 
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FIG.7C 
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FIG.8 
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Fig. 11 was missing at the time of publication 



